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(54) Method of preparing normalized and/or subtracted cDNA 



(57) A method of preparing normalized and/or sub- 
tracted cDNA; a method in which the cDNA that is nor- 
malized and/or subtracted is in the form of uncloned cD- 
NA (cDNA tester); a method of preparing normalized 
and/or subtracted cDN A comprising the steps of: (a) 

preparing cDNA tester; (b) preparing normaliza- 
tion and/or subtraction RNA driver; (c) conducting nor- 
malization and/or subtraction in two steps in any order, 
or conducting normalization/subtraction as a single step 
and mixing the normalization/subtraction RNA driver 



with said cDNA tester; (d) adding an enzyme capable of 
cleaving single strand sites on RNA drivers nonspecifi- 
cally bound to cDNA tester; (e) removing said single 
strand RNA driver cleaved in step d) from the tester and 
removing tester/driver hybrids; and (f) recovering the 
normalized and/or subtracted cDNA; and a method of 
efficiently preparing normalized and/or subtracted (ong- 
chain, full-coding, and full-length cDNA libraries are pro- 
vided. 
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Description 
Technical Field 



[0001] The present Invention relates to an improved method of preparing normalized and/or subtracted cDNAs or 

[0002] hb ThTpresen^ invention also relates to a method for improvement of the normalization and subtraction steps 
by eliminating non-specifically bound hybrids. 

Background of the art 

r00031 Methods of preparing cDNA libraries have been disclosed and are well known in prior art. For example, they 
are described by Ederly I., et al., 1995, Mol Cell Biol, 15:3363-3371 ; Kato S., et al., 1994, Gene, 150:243-250; and K. 
Maruvamaet al., 1995, Gene, 138:171-174. 

pE in Prior art. Carninci et al., 1996, Genomics 37:327-336; Carninci et al.. 1997. DMA Research, 4:61 -66, and 
Carninci and Hayashizaki, 1 999, Methods Enzymol, 303: 1-44, describe efficient methods forthe Preparation of cDNAs. 
These methods, comprising a modified "tagged cap trapper" to se.ect long-strand, full-coding « n ^ or ^ ul| -' e M n 9^ r ^ 
libraries after tagging of the cap structure, allow the preparation of long, full-coding and/or full-length cDNA l.branes 
containing all of a particular coding sequence and its 3' and S untranslated regions (UTRs) Such libraries are partic- 
ularly useful for large-scale sequencing projects in which the recovery of long, full-coding and/or full-length (full-coding/ 
lenoth) clones is required from among truncated clones (EST sequences). 

[00051 However, the preparation of long, full-coding/tength cDNA libraries entails certain problems. The preparation 
of lone, or full-coding/length cDNA is more efficient for short-strand mRNAs than for long-strand mRNAs (transcnpts). 
In addition cloning and amplification is more difficult for long-strand cDNAs than for short-strand cDNAs. further mtro- 
ducinq size bias Using truncated cDNAs to recover full-length cognates is impractical at the genomic-scale level; 
however, cDNAs in a standard library can be cloned in either their long, full-coding/length or truncated form, thus 
favorinq discovery of at least one EST for any gene, regardless of length. • 
[0006] Another problem relates to the nature of cellular mRNA. mRNA can be classified into superprevalent (or 
abundant), intermediate and rare mRNA based on expression. In a typical cell, 5-10 species of superprevalent mRNA 
comprise at least 20 percent of the amount of mRNA, 500 to 2.000 species of intermediately expressed mRNA compnse 
40 to 60 percent of the amount of mRNA, and 10,000 to 20,000 rare species of mRNA compnse less than 20 to 40 
percent of the amount of mRNA. This average distribution may vary markedly between tissue sources, and the presence 
of numerous highly expressed genes may further alter this distribution. SequencingcDNAfrom standard cDNA l.branes 
is ineffective to discover rarefy expressed genes because intermediately and highly expressed cDNA ends up be.ng 

ro007] S,V ^ncrmost C mRNAs of the superprelevant and intermediate frequency classes have been identified, redun- 
dancy levels are expected to exceed 60 percent. Thus, the use of a hybridization normalization method has been 
proposed to solve this problem. The principle behind normalization is to decrease the frequency of the most abundant 
clones while increasing the frequency of less prevalent cDNAs. Several methods of normalizat.on for the preparation 
of EST cDNAs are introduced by Scares et al.. 1994. Proc. Natl. Acad. Sci. 91:9228-9232. who has disclosed a nor- 
malization method for preparing EST sequences. This method is based on the reassociation of nucleic acids cloned 
in amplified plasmid libraries. However, amplified plasmid libraries subjected to normalization are not useful or the 
preparation of long-strand, full^oding/length cDNAs. This is because there is a cloning bias associated wrth plasmid 
libraries where short-strand cDNAs are efficiently cloned with cloning efficiency decreasing with the length of the strand. 
In fact in Scares et al.. 1 994. DNA must be cloned into a plasmid and then be converted to tester single-strand DNA^ 
The ligation to plasmlds reduces the strand length of the cDNA that is recovered (that is. long strands of cDN A tend 

[OOMrAddrtionalty. during library amplification prior to normalization, the ease with which cDNA clones are grown 
varies with plasmid length. Thus, long-strand, full-coding/length clones tend to be underrepresented following bulk 
amplification of the library. In amplified plasmid libraries, the recovery of full-coding/length clones becomes even more 



[00091 ' Other literature, such as Tanaka et al.. 1 996, Genomics, 35:231-235, discloses methods for the preparation 
of EST sequences in which mRNA is first annealed on oligo-dT conjugated on a solid matrix. This method is not suitable 
for preparing normalized long-strand, full-coding/length cDNAs because of mRNA degradation before cDNA synthesis. 
Further the hybridization rate of nucleic acids immobilized on a solid phase is slower than that in solution hybnd.zat.on. 
[0010] ' Libraries created with PCR- and solid matrix-based normalization technologies known in the art exhibit se- 
quence redundancy similar to that of non-normalized cDNA libraries used in EST projects. 

[0011] ■ An additional problem consists in that in the preparation cDNA libraries or encyclopedias (for example, a 
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mammal full-length cDNA encyclopedia) with the aim of collecting at least one long-strand, full-coding/length cDNA 
for each gene expressed irrespective of the tissue source, not only is it desirable to remove cDNAs that are redundant 
within the library, but also cDNAs that have already appeared in previous libraries, so as to accelerate the discovery 
of new long-strand, fujl-coding/length cDNAs. 

[0012] To solve this problem, hybridization subtraction methods have been proposed. 

[0013] Sagerstrorn et al., Annu. Rev. Biochem., 1997, 66:751-83, gives an overview of the subtraction methods 
known in the art. The basic idea of subtraction is that the nucleic acid from which one wants to isolate differentially 
expressed sequences (the tracer or tester) is hybridized to complementary nucleic acid that is believed to lack se- 
quences of interest (drivers) and in which the drivers are present in much higher concentration than thetesters. The 
tester and driver nucleic acid populations are allowed to hybridize, with only sequences common to the two populations 
forming hybrids. After hybridization, driver-tester hybrids and unhybridized drivers are removed, and the remaining 
nucleic acids can be used to prepare a library rich in tester-specific clones or to make probes that can be used to 
screen a library for tester-specific clones. 

[0014] However, subtraction methodsalso entail the same problems described for normalization with PCR- and solid 
matrix-based technologies. They are suited to the preparation of EST sequences, but cannot be used to prepare long- 
strand, full-coding/length cDNAs. 
[0015] Bonaldo et al., 1 996, Genome Reseach, 6:791 -806, discloses a subtractive hybridization approach specifically 
applied to reducing the expression of pools of already sequenced clones from normalized libraries yet to be surveyed. 
[0016] This normalization and subtraction technique (Bonaldo et al. 1996) is useful for large-scale gene discovery 
20 in EST research, but has the drawbacks already indicated in prior art (cDNA cloned in amplified plasmid as disclosed 
by Soares et al., 1994) and is not suited to long-strand and full- coding/length cDNA inserts. 

[0017] Inparticular, as stated above, during library amplification prior to the normalization and subtraction steps, the . 
amplification of cDNA clones varies with plasmid length, with long clones being underrepresented following bulk am- 
plification of the library. That is, the relative expression of long-strand cDNA clones decreases, rendering such cloning 

25 difficult. 

[0018] A further problem of the normalization and subtraction method disclosed by Bonaldo et al. is that both the 
normalization and subtraction steps require incubation and an incubation period causing the breakup of plasmids - 
bigger plasmids (containing long-strand cDNAs) in particular. As a consequence of the normalization and subtraction 
steps, the number of resulting long clones is very limited ornull. 
30 [001 9] This also confirms the unsuitability of this method to the preparation of normalized and subtracted long-strand, 
full-cloning/length cDNAs. 

[0020] A still further problem relating to normalization and/or subtraction methods is that non-specificaily-bound test- 
er/driver hybrids form in these steps due to complementary binding of imperfect sequences. The removal of such 
hybrids would result in the elimination from the tester of targeted cDNAs erroneously considered to be abundant and/ 
or to have already been sequenced in other libraries, but which in reality are not abundant and have not been previously 
sequenced. 

[0021] Accordingly, the purpose of the present invention is to solve the several problems of prior art and to provide 
an efficient method for the preparation of normalized arid/or subtracted long-strand and full-coding/length cDNA librar- 
ies. 



35 



40 



Summary of the invention 



[0022] The present invention provides a method capable not only of normalizing cDNA, but also of subtracting cDNAs 
that have already appeared in other libraries. Accordingly, the present Invention provides an efficient method for the 

45 preparation of normalized and/or subtracted cDNAs, preferably, long-strand and/or full- coding/length cDNA or cDNA 
libraries. Based on this method, the problems of PCR- and solid matrix-based techniques are solved. 
[0023] Accordingly, an embodiment of the present invention relates to a method of preparing normalized and/or 
subtracted cDNA, characterized in that the normalized and/or substracted cDNA is reverse transcript of mRNA and 
cDNA that has not been cloned (hereinafter referred to as cDNA tester). This avoids the drawbacks of the state of the 

50 art caused by cloned cDNA tester amplified in plasmid. 
[0024] This method comprising the following steps: 

I) preparing uncloned cDNA tester preferably a cDNA tester not cloned in plasmid; 

II) preparing polynucleotide drivers for normalization and/or subtraction; 

$ 5 in) normalizing and/or subtracting (one, two, or more steps), removing the tester/driver hybrids obtained by nor- 

malization and/or subtraction, and removing unhybridized polynucleotide drivers; and 
IV) recovering the normalized and/or subtracted cONA. 



3 



EP1 197 552 A2 



The DNA tester of step I) is preferably long-strand, full- coding/length cDNA. 

[0025] Further the method of the present invention may comprise the step of V) by preparing a second strand cDNA 
complementary to the normalized and/or subtracted cDNA and cloning the double-strand cDNA that is recovered. 
[0026] According to another embodiment, the present invention relates to a method of preparing cDNA, preferably 
5 long-strand, full-coding/length cDNA. wherein the normalization and subtraction drivers are mixed together and nor- 
malization and subtraction are performed in a single step (normalization/subtraction). 

[0027] According to a further embodiment, a method of improving normalization and/or subtraction by eliminating 
RNA (driver) bound non-specifically to cDNA (tester) by treating the non-specific binding RNA/DNA hybrid with an 
enzymecapable of severing single-strand sites in RNA drivers is provided. This enzyme can be a nuclease, in particular, 
10 a ribonuclease capable of cleaving single-strand RNA or a mixture thereof. Preferably RNase I (also indicated as 
RNase 1) can be employed. 

[0028] However, this treatment is not limited to the normalization and subtraction of hybrids in the method of preparing 
cDNA, but can be used to cleave sites in RNA that has become partly single-stranded as a result of nonspecific binding 
and thus selectively remove nonspecifically bound RNA/DNA hybrids in any species of nonspecifically-bound RNA/ 
15 DNA hybrid. 

[0029] Accordingly, a method of preparing single-strand and/or double-strand cDN A by treating nonspecifically bound 
RNA/DNA hybrids with an enzymecapable of cleaving single-strand RNA (capable of cleaving sites in RNA that has 
become partly single-stranded), removing the cleaved RNA, and recovering the cDNA is provided. 
[0030] The normalized and/or subtracted cDNA prepared by any of the methods of the present invention may be 
20 single-strand or double-strand cDNA. 

Brief Description of the Drawings 

[0031] Fig. 1 is a schematic of a suitable normalized and/or subtracted cDNA preparation protocol. A) A general 
25 scheme for the preparation of long-strand, full- coding/length single-strand cDNA; B) Representation of a population 
of different tester cDNAs; C) normalizing drivers (cellular mRNA) and subtracting drivers (run-off transcripts); D) hy- 
bridization; E) rare/new cDNAs are used for second strand cDNA preparation (normalized/subtracted cDNA library); 
F) abundant cDNAs/unwanted cDNAs are removed and may be used for the preparation of minilibraries to implement 
subtraction. 

30 [0032] Fig. 2 shows (on the right side of the drawing) the use of RNase I able to recognize and cleave RNA (driver) 
bound non-specifically to cDNA (tester). With this method, new and/or rare cDNAs are recovered. In contrast, as shown 
on the left side of the figure, when no RNase I treatment is performed, the new and/or rare tester bound non-specifically 
to the driver is captured by beads and removed. 

[0033] Fig. 3 shows on the right side an electrophoretic pattern of pancreas cDNA normalization/subtraction per- 
35 formed in one single step compared with a run of pancrease cDNA that has not been normalized/subtracted. This 
example of normalization/subtraction provides a visual image of the removal of highly abundant full-length cDNAs, 
which are visible in the example of cDNA that has not been normalized/subtracted (indicated by arrow). 
[0034] Fig. 4 shows results of plaque hybridization of replicas for the genes EF-1 alpha, carbonyl-reductase and 
uteroglobin containing a standard (not normalized and not subtracted) length cDNA library (left) or a normalized length 
40 cDNA library (right). On the right panel (normalized), the arrow indicates the plaques that have been counted. The 
number of plaques (clones) counted for the normalized library is sensitively lower than that for the standard library. 
[0035] Fig. 5 shows results in which the increase in sequencing redundancy (or decrease in new gene discovery) 
increases sharply in standard cDNA libraries (-000 libraries), but in normalized/subtracted full-length cDNA libraries (- 
1 00 libraries) the increase in redundancy is quite slow. New genes (percent) are indicated as singletons (percent) within 
45 a given cDNA library. 

[0036] Fig. 6 shows an electrophoresis test for evaluating the efficiency of removal of driver/tester captured using a 
subtraction method based on the present invention. 

[0037] Only one clone (cDNA) tester was used and it was subtracted with a driver made with the same cDNA. 
[0038] Lanes 1-3 show subtracted cDNA when biotinylation was performed in ice, and lanes 4-6 when performed at 
so room temperature (RTT). Lanes 7-9 comprise tester and driver without bead treatment (therefore not removed) at 1 00 
percent, 1 0 percent and 2 percent, respectively. 

[0039] Lane 9 shows that a quantity of 2 percent, even if low, is still evident. In lanes 1 -6, there is no evident presence 
of tester/driver hybrid, indicating that subtraction was almost 100 percent. 

55 Detailed Description of the Invention 

[0040] According to an embodiment of the present invention, an efficient method for the preparation of normalized 
and/or subtracted cDNA or cDNA libraries, preferably long, and/or full- coding/length cDNA libraries, is provided by 
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preparing uncloned cDNA as tester. The uncloned cDNA tester may be cDNA not cloned in plasmid. 
[0041] This method does not require the PCR cloning step and the binding of oligo-dT to solid matrix before the 
normalization and/or subtraction step/s (one, two, or more steps). As a result, long and/or full- coding/length cDNAs 
are recovered. 

5 [0042] Accordingly, the method of the present invention may comprise only the normalization step, only the subtrac- 
tion step, both the normalization and subtraction steps in any order, or normalization and subtraction performed in a 
single step. 

[0043] These normalized and/or subtracted cDNAs are treated to synthesize a complementary second strand when 
they are single-strand, and finally cloned. 
10 [0044] Accordingly, this method comprises the steps of: 

I) preparing an uncloned cDNA (tester); J I) preparing, polynucleotide (driver) for normalization and/or subtraction; 

III) performing the normalization and/or the subtraction step/s (one, two, or more steps), and removing the tester/driver 
hybrids obtained by normalization and/or subtraction and the non-hybridized polynucleotide drivers; and 

IV) recovering normalized and/or subtracted cDNAv(rare and/or new cDNAs). 

* 5 [0045] The method based on the present invention further comprises, when the cDN A tester is single-strand, a step 

V) : preparing the second cDNA strand and cloning. 

[0046] The above steps j)-V) can be repeated several times (two-times, three-times, etc.) according to the conven-r 
ience. 

The uncloned cDNA tester of step I) may be cDNA not cloned in plasmid. The cDNA tester of step I) may be a reverse 
20 transcript of mRNA in the form of an uncloned cDNA. The uncloned cDNA of step l) is preferable a long and/or full- 
codihg/length cDNA tester. 

[0047] For the purposes of the present invention, the term "full-length cDNA" is used to denote 5' and 3' UTR se- 
quences and T-primer oligonucleotides (that are complementary to mRNA comprising poly-A). "Full-coding cDNA" 
means a cDNA sequence comprising at least a start codon and a stop codon. And by "long-strand cDNA" is understood 
a cDNA sequence which is almost full-coding and/or full-length but lacks one or more bases at the 3* end (corresponding 
to the 5* end of mRNA) or at the 5' end if considering a cDNA strand complementary to cDNA that is complementary 
to mRNA (that is, cDNA having the same direction as the gene). This early stop (before reaching the 5* end) can be 
caused by the formation of a secondary structure in the mRNA at the Cap structure level, impeding synthesis of cDNA. 
[0048] The cDNA tester according to the invention may be a single or double strand. In case of double strand, it is 
30 treated according to the technique known in the art allowing dissociation of double strand cDNA in single strand and, 
in presence of RNA driver, allowing the formation of hybrid cDNA/RN A. An example of this technique comprises mixing 
the tester double strand cDNA and RNA driver in presence of formamide at a concentration higher than 50%, preferably 
80%, at temperature 40-60°C, preferably 50°C, in usual hybridization buffer). 

[0049] The cDNA tester of step I) can be prepared by any method of cDNA preparation known in prior art, preferably 
35 any known method for the preparation of long-strand and/or full-coding and full-length cDNAs. Examples are Ederly 
et al.; Kato et al.; and Maruyama et al.(the oligo-capping method). . 

[0050] In particular, the pligo-capping method is a method where phosphoric esters of incomplete cDNAs without 5' 
Caps are removed with alkaline phosphatase and then all cDNAs are treated with the tobacco acid pyrophosphatase 
(TAP) used as decapping enzyme so that only full-length cDNAs have phosphates. 
*o [0051] Preferably, a method employing the CAP-trapping technique described by Carninci et al., 1996, Genomics 
37:327-336, and Carninci and Hayashizaki, 1999, Methods Enzymol, 303: 1-44, is used. 
[0052] An example of this method is schematically illustrated in section A of Figure 1 . 

[0053] mRNA is isolated from tissue and RNA-DNA hybrids are made by reverse transcriptase starting from primers 
such as oligo-dT or random or specific primer-adapters, using mRNA as template. A tag molecule is then chemically 
bound to the diol structure of the 5' CAP ( 7Me Gpp p N) site of the mRNA forming the hybrid. Finally, RNA-DNA hybrids 
carrying a DNA corresponding to long, full-coding/length mRNA are separated from hybrids carrying tag molecules by 
binding the tag molecules. 

[0054] Tag molecules that are bound to the Cap after formation of the RNA-DNA hybrids are particularly advanta- 
geous because the hybrid structure of RNA-DNA can escape chemical cleavage of mRNA during the aldehydratton of 
50 the diol structure necessary for labelling mRNAs with tag molecules. As a result the efficiency of full-coding/length 
cDNA synthesis increases. 

[0055] The tag molecule can be bound to the 5 1 Cap site by, for example, an oxidation ring-opening reaction of the 
5* Cap site diol structure with an oxidizing agent such as sodium periodate to form a dialdehyde and subsequent 
reaction of the dialdehyde with a tag molecule having a hydrazine terminal. 
55 [0056] Examples of tag molecules having hydrazine terminals are biotin, avidirt and streptavjdin, as well as digoxi- 
genin molecules having hydrazine terminals. A molecule showing reaction specificity, such as antigens and antibodies, 
can also be used as tag molecule. The label molecule employed as tag molecule is not specifically limited. 
[0057] Accordingly, the preparation of cDNA used to practice any of the methods based on the present invention 



45 



5 



EP 1 197 552 A2 



comprises the steps of: 

(1) synthesizing a first strand of cDNA by reverse transcriptase forming the hybrid mRNA/cDNA; 

(2) chemically binding a tag molecule to the diol structure of the 5' CAP ( 7Me G ppp N) site of the mRN A forming the 
5 hybrid; 

(3) capturing long, full-coding, and full-length cDNA hybrids; and 

(4) removing single-strand mRNA by digestion with an enzyme (preferably with RNase H) that is capable of cleaving 
single strand mRNA or by using an alkali (preferably NaOH). 

10 [0058] A more specific example of a method for preparing cDNA including steps from 

(1) synthesizing first cDNA strands to (7) synthesizing double-stranded full- coding/length cDNA (having for ex- 
ample biotin as tag molecule) is as follows: 

is (1) synthesis of first strand cDNA (synthesis of an RNA-DNA hybrid); 

(2) biotinylation of an mRNA of the RNA-DNA hybrid; 

(3) ribonuclease I (RNase I) digestion; 

(4) capture of a full- coding/length cDNA hybrid (with avidin or streptavidin beads); 

(5) removal of hybrid RNA (RNase H digestion); 

20 (6) G tail addition by terminal deoxynucleotidyl transferase; and 

(7) preparation of second strand (double stranded full- coding/length cDNA) primed with oligo C. 

[0059] Alternatively, step (5) can be performed with an alkali (preferably NaOH). The cDNA prepared in step (6) may 
be used as cDNA tester for the purpose of the present invention. After normalization and/or subtraction, step (7) may 
2s be performed. 

The cDNA obtained by the method of step I) consists of various populations of cDNA, namely superprevalent (or highly 
expressed or class I), intermediate (or class II) and rare cDNAs (class III) as indicated in section B of Figure 1 (inter- 
mediate and superprevalent are indicated together in section B as abundant). Some of these cDN As must be considered 
as having been previously collected in other libraries and are indicated in section B as previously collected cDNA. The 
30 cDNAs that were obtained are indicated in section B as "testers". 

[0060] In step II), "drivers" are prepared for normalization and/or subtraction. 

[0061] Normalization drivers are RNA or DNA obtained from the same tissue as the population, and/or the same 
population, that one intends to eliminate. 

[0062] Normalization drivers can be for example cellular mRN As of the same library, that is, aliquots of the mRNAs 
35 initially used to prepare the cDNA library (starting material mRNA). Normalization drivers can also be cDNA obtained 
from the same library that one intends to normalize. In that case, single-strand cDNAs have to be prepared from the 
cDNA library by PCR, for example. 

[0063] Subtraction drivers are RNA or DNA obtained from different tissue from that which one intends to substract, 
RNA or DNA obtained from the same or a different strain (system) from that which one intends to subtract, or the same 

40 tissue belonging to a different population.from the population that one intends to eliminate. 

[0064] In vitro-transcribed RNA from a DNA library, preferably clones from different tissue or from the same tissue 
but belonging to a different population and prepared with the cap-trapper technology, can be employed in the subtraction 
step. However, subtraction drivers prepared with any method known in the art, for example, as described by Sagerstrom 
et aL, 1997, Annu. Rev. Biochem., 66:751-83, can be used for the purpose of the present invention. 

45 [0065] For example, subtraction drivers can be run-off transcripts from minilibraries containing expressed genes, 
rearrayed clones, and in some cases, previously sequenced (but not necessarily) cDNAs. 

[0066] Subtraction run-off transcripts are obtained by RNA polymerase (for example, T7, T3 , SP6 or K11 RNA 
polymerase) from DNA templates carrying appropriate promoters, such as a DNA sequence flanked with promoters, 
plasmids, phages, and analogs thereof. In the case of plasmids, subtraction transcripts can be prepared by amplifying 
so the cDNA library or rearrayed library with a solid or liquid phase. Preferably, subtraction transcripts can be prepared 
by spotting colonies obtained from well plates (for example 384 well plates) onto LB+ ampicillin-agar plates, growing 
them at a temperature of from 30°C to 37°C, and following overnight growth : scraping the colonies for use in bulk 
plasmid preparation. 

[0067] DNA can also be used for subtraction drivers. In that case, single-strand DNA isolated from clones obtained 
55 from different tissues or from the same tissue but belonging to a different DNA population can be used. 

[0068] Minilibraries are libraries comprising a portion of the clones of the starting tissue or of different tissues. 
[0069] A schematic example of the preparation of drivers for normalization and/or for subtraction is shown in section 
C of Figure 1 . However, methods based on the present invention comprise only the normalization step, only the sub- 
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traction step, the normalization and subtraction steps in any order, or normalization/subtraction in a single step. These 
drivers can be bound with a tag molecule. This (first) tag molecule can be any molecule able to bind or tag the drivers 
and also able to bind to a matrix permitting elimination of the driven The tag molecule can also bind a second tag 
molecule, which can be the same as the first tag molecule or a different molecule, this second tag molecule being able 
to bind to a matrix or to a further tag molecule. Preferred tag molecules are biotin : avidin, strepavidin, digoxigenin, or 
any antibody thereto, preferably an antibiotin, antiavidin, antistrepavidin or or antidigoxigenin antibody, or any antian- 
tigen antibody thereof. However, the tag molecule is hot limited to these substances. 

[0070] Steps III) and IV) can be executed in different order depending'on whether the normalization and subtraction 
steps are carried out consecutively or in a single step. 

[0071] According to a first approach, normalization step (III) is performed by mixing with normalization drivers, fol- 
lowed by elimination of the. hybrids and recovery of the normalized single-strand cDN As (step IV). Next, these single- 
strand cDNAs are mixed with subtractive drivers (step III), the hybrids are eliminated and (normalized and) subtracted 
single strand cDN As are recovered (step IV). The single-strand cDNAs that are recovered consist of rare and new 
cDNAs as exemplified in section D of Fig.1. The normalization and subtraction steps can also be performed in inverted 



[0072] According to another approach, the normalization drivers and the subtractive drivers prepared in step II) are 
mixed together, with a single normalization/subtraction step being performed in step III). 

[0073] The implementation of normaiization/substraction in a single step affords the advantage of performing only 
one incubation step (instead of two). Performing the normalization and subtraction in two different distinct steps entails 
20 the problem of performing two incubation steps and each incubation step tends to reduce the number of long and/or 
full-coding/length cDNAs. Thus, performing only a single incubation step is advantageous. 

[0074] However, the normalization and/or subtraction step and the single normalization/subtraction step can be re- 
peated, if necessary, before final recovery of the rare new cDNAs and synthesis of second strand cDNA. 
[0075] Accordingly, the present invention also discloses a method for the preparation of normalized and subtracted 

25 cDNA comprising the steps of I) preparing a cDNA tester, which may be uncloned according to the first embodiment 
above but riot limited to it, II) preparing polynucleotide normalization driver and polynucleotide subtraction driver (nor- 
malization/subtraction drivers), III) performing normalization and subtraction in one single step, by mixing together the 
tester and the normalization/subtraction drivers, and IV) recovering the normalized and subtracted cDN A. 
[0076] This method may also include the step of adding an enzyme capable of cleaving single-stranded RNA driver 

30 nonspecifically bound to single strand, for example using RNase I, and removing the cleaved single strand RNA driver. 
This step of addition of RNase I or other enzyme capable of cleaving single-stranded RNA is disclosed more in details 
later. 

[0077] The preparation of normalization and/or subtractive drivers, the hybridization step(s) (one or more), and the 
removal of "uninteresting cDNAs", i.e. hybrids produced in the normalization and/or subtraction steps and single drivers 
35 (drivers not forming hybrids) can be accomplished by any technique known to the art. For example, these may be 
conducted as described in Bonaldo et al., 1996, and in Sagerstrom et al. 1997, Annu. Rev. Biochem., 66:751-783 (from 
page 765; also Table 1). 

[0078] As a specific example, the hybridization technique relating to photoactivated biotin, strepavidin binding and 
organic extraction described by Barr F.G. and Beverly S. Emanuel, 1990, Analytical Biochemistry, 186: 369-373 or in 

40 Hazel L. Sive andTom St John, 1 988, Nucleic Acids Research, Vol. 1 6, number 22, from page 1 0,937, can be employed. 
[0079] However, techniques known in the art such as are described by Sagerstrom et al., 1997, may be employed. 
[0080] Following normalization and/or subtraction the tester/driver hybrids are removed by any technique known in 
the art, such as that described by Sagerstrom et al., 1997, from page 765. For example, this may be accomplished by 
the addition of a matrix such as beads, preferably magnetic beads or agarose beads. The beads are preferably covered 

4 * with a tag molecule, as set for above, or bind to tag molecules. Beads covered with streptavidin (generally referred to 
as "streptavidin beads") are preferred, with magnetic porous glass (MPG) streptavidin beads (CPG Inc.) being even 
more preferred. Beads covered with or bonded to avidin, biotin, digoxigenin, an antibody, or an antigen can also be 
used. The antibody covering or bound to the beads (one, two, or more types of beads) can be an antibody generally 
able to recognize tag molecules, preferably an antibody which recognizes antibody bound to the drivers, or an antibiotin 

so antibody, antiavidin antibody, antistreptavidin antibody, or antidigoxigenin antibody which recognizes biotin, avidin, 
streptavidin or digoxigenin bound to the drivers. 

[0081] An example of magnetic beads bound to or covered with biotin as a tag molecule forming a tester/driver 
hybrid-bead aggregate is shown in section F of Fig.1 . 

[0082] Streptavidin or avidin/phenol may be employed instead of magnetic beads to remove the hybrid (Sive H.L. 
55 and St.John T., 1988, Nucleic Acids Res., 16:1 0937; andSagerstrom et al., 1997). 

[0083] It is also possible to employ hydroxyapatite (HAP) and unlabeled RNA to remove tester/driver hybrids. An 
example is described by Sagerstrom et al., page 765 and Table 1 . 

[0084] The removal of tester/driver hybrids by the subtraction method based on the present invention permits almost 
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100 percent removal as seen from the electrophoresis of Figure 6 and from Example 3. 

[0085] The cDNA of the removed tester/driver hybrids can be used for the preparation of cDNA minilibraries to be 
used for further subtractions steps as shown in section F of Fig.1 . 

[0086] The normalized and subtracted cDNAs (rare and new cDNAs) that are recovered in step IV) are then treated 
5 to synthesize second cDNA strands, subjected to restriction digestion, ligated, and cloned into vectors, as schematically 
indicated in section E of Fig.1. 

[0087] The advantage of the method based on the present invention is that it maintains a high proportion of long, 
full- coding/length cDNAs in the subtracted/normalized library. Further, the present method increases the discovery of 
new genes relative to the results obtained by using standard, full-length cDNA libraries prepared based on prior art. 

w [0088] That is, in the method of the present invention, normalized and/or substracted cDNA (tester cDNA) is an . 
uncloned cDNA. Preferably, a cDNA not cloned in a plasmid. Preferably, the tester acccording to the invention is a 
reverse transcript of mRNA in the form of uncloned cDNA. This cDNA is preferably a single-strand. Thus, the prior art 
problem of cloning bias against large cDNAs in plasmid libraries and the problem in libraries generated by normalization * 
techniques based on PCR and solid matrixes can be avoided, and the advantage of increased discovery of new genes 

is is afforded. 

[0089] As stated in the prior art section, a further problem relating to normalization and/or subtraction methods is 
that non-specific tester/driver hybrids form during these steps due to complementarity binding of imperfect sequences. 
For example, this is caused by cross-reactivity between similar but unidentical sequences in testers and drivers. The 
removal of such hybrids eliminates from the tester cDNAs those cDNAs considered to be erroneously abundant and/. 
2° or to have been already sequenced in other libraries, as well as other desirable sequences. This constitutes a major 
drawback to normalized/subtracted libraries. 

[0090] This problem is schematically depicted in Figure 2. On the left side of Fig.2, normalization and/or subtractive 
drivers (mRNA, upper strands) are non-specifically bound (there is a portion of the mRNA that is not bound to the 
cDNA) to cDNAs (lower strands) that are new and/or rare but erroneously believed to be abundant and/or to have been 

25 previously collected. cDNA testers that are bound to drivers, albeit nonspecifically, are removed during normalization 
and/or subtraction as indicated in Fig. 1 . Thus, when the cDNA testers are new and/or rare cDNAs, they are lost. 
[0091] In the method of the present invention, in such nonspecific binding, such as is shown on the right side of Fig. 
2, drivers that are nonspecifically bound to cDNA testers are eliminated by degrading RNA with RNase I or some other 
enzyme described further below and the new and/or rare cDN A is recovered. 

30 [0092] As shown in Fig. 2, one embodiment of the present invention provides a method in which nonspecifically 
binding RNA/DNA hybrids are processed (digested) by an enzyme cleaving single-strand RNA (an enzyme having the 
ability to cleave single-strand sites in RNA drivers nonspecifically bound to cDNA testers) to cleave single-strand sites 
of RNA (drivers) nonspecifically bound to cDNA (testers). Next, hybrids with RNA that has been cleaved from the cDNA 
testers are eliminated (denatured) to remove the cleaved RNA from the system, leaving the cDN A testers and improving 

35 the efficiency of normalization and subtraction (that is, reducing unintended exclusion from the rare cDNA system). 
[0093] There is no limitation to the method of removing RNA fragments from the nonspecifically bound RNA/DNA 
hybrids that have been treated with enzymes cleaving single-strand RNA. For example, as shown in Fig. 2, the hybrids 
may be processed at a suitable temperature to eliminate, that is, denature, hybrids with RNA that has been cleaved 
from cDN A tester, and the RNA freed from the cDN A tester may be removed using tags bound to the RNA, for example. 

40 The elimination of hybrids with RNA that has been cleaved from cDNA tester does not affect specifically bound RNA/ 
DNA hybrids. The conditions for selectively denaturing hybrids that have been cleaved and the strand length of the 
hybrid portion that has been shortened are suitably selected. Denaturing conditions depend not just on temperature, 
but also on the pH of the system (the aqueous solution containing the hybrids), salt concentrations, and the like. De- 
naturing is conducted for example by processing at a temperature of 25-1 00°C (i.e. up to the boiling point), preferably 

45 37-70°C, and still more preferably at a temperature of 65°C. The use of the above-stated temperature denatures hybrids 
of partially cleaved RNA, the RNA previously comprising the hybrids dissociates from the cDNA, the dissociated RNA 
is bound to beads using tags bound to the RNA, and the RNA fragments bound to the beads are removed with a magnet 
by the usual methods. Specifically bound RNA/DNA hybrids are maintained under the above-stated denaturing con- 
ditions, and tags bound to the RNA in the same manner as set forth above may be used for binding to beads and 

so removal. That is, conventional normalization and/or subtraction are employed. 

[0094] A single-strand-specific RNA endonuclease (ribonuclease) can be employed as the above-described enzme 
cleaving single-strand RNA. Examples suitable for use are RNase A specific to pyrimidine (U and C), RNase 4 specific 
to U, RNase T1 specific to G, RNase 2 or RNase 3 specific to U, and RNase I able to degrade any ribonucleotide 
(Hyone-Myong Eun, Chapter for "Nucleases"; Sorrentino Salvatore and Libonati Massimo, 1997, FEBS Letters, 404: 

S5 1 -5: these authors use the arabic numbers in order to indicate the above ribonucleases, however these ribonucleases 
are also indicated with the roman numbers in literature). Alternatively, RNase T2 having little base specificity can be 
used as the enzyme cleaving single-strand RNA (BioTechniques, 232, Vol.12, No.2, 1992). 

[0095] RNase I is employed with preference as the enzyme cleaving single-strand RNA. A mixture of the above- 
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listed ribonucleases may also be employed. Hybrids may be subjected to the action of single-strand-specific RNA 
endonucleases by the usual methods. For example, 0.01 -1 unit of a single-strand-specific RNA endonuclease may be 
employed per 1 \xg of driver. 

[0096] The step of degrading mRNA drivers nonspecifically bound to cDNA testers may be conducted in the normal- 
5 ization/subtraction step based on the present invention, following the normalization and/or substraction step, or after 
a single normalization/substraction step. 

[0097] The cDNA tester according to this method, can be cloned or uncloned cDNA. Uncloned tester can be for 
example a cDNA not cloned in plasmid. The cDNA tester is preferably a reverse transcript of mRNA in form of uncloned 
cDNA. ■ 

w [0098] The cDNA tester is preferably a long-strand, full-coding, and/or full-length cDNA, preferably prepared accord- 
ing to the cap-trapping technology as above described. 

[0099] Accordingly, an aspect of the of the present invention is a method specifically comprising the steps of: 

(a) preparing cDNA testers; 
15 (b) preparing normalization and/or subtraction RNA drivers; 

(c) performing normalization and/or subtraction in two steps in any order, or performing normalization/subtraction 
in a single step by mixing normalization/ subtraction RNA drivers with cDNA testers; 

(d) adding an enzyme having the ability to cleave single strand sites on RNA drivers nonspecifically bound to cDNA 
testers; 

20 (e) removing the cleaved single-strand RNA drivers of step d) from the testers and removing tester/driver hybrids; 

(f) recovering normalized and/or subtracted cDNAs; and 

(g) preparing second strand cDNAs and cloning the recovered cDNAs. 

The processing methods for removing single-strand RNA drivers can be applied beyond the normalization and 
substraction of hybrids in methods of preparing cDNA. For example, they can be used to remove partially single- 
25 strand RNA in all types of nonspecific RNA/DNA hybrids. 

That is, the present invention comprises a method of treating nonspecifically bound RNA/DNA hybrids with 
an enzyme capable of degrading single-strand RNA to remove RNA that is nonspecifically bound to DNA. In this 
method, nonspecifically bound RNA/DNA hybrids are treated with an enzyme capable of degrading single-strand 
RNA to degrade RNA nonspecifically bound to DNA and remove it from mixtures of DNA and/or RNA/DNA hybrids 
30 specifically bound to DNA. This method may be employed with the object ofxecovering DNA nonspecifically bound 

to RNA or with the object of recovering just DNA hybrids specifically bound to RNA. 

In the present method, as set forth above, the enzyme capable of degrading single-strand RNA may either be 
selected from the group consisting of RNase I, RNaseA, RNase4, RNaseTI , RNaseT2, RNase2 t and RNase 3, 
or comprise a mixture thereof. RNase I is the enzyme of preference. 
35 The RNA/DNA hybrids including the above-described nonspecifically bound RNA/DNA hybrids may be the 

products of methods comprising normalization and subtraction steps conducted in any order or the products of 
methods comprising a single step of normalization/subtraction. That is, nonspecifically bound RNA/DNA hybrids 
may be removed from RNA/DNA hybrids obtained by conventional normalization and/or subtraction methods and 
the recovery rate of long-strand or rare cDNA improved. Accordingly, the cDNA forming the above-described non- 
40 specifically bound RNA/DNA hybrids can be long-strand, full-coding and/or full-length cDNA. 

The present invention further discloses a method of isolating single-strand cDN A in which the above-described 
hybrids comprising RNA nonspecifically bound to cDNA is treated with an enzyme capable of degrading single- 
strand RNA, the degraded single-strand RNA is removed, and the DNA is recovered. In this method, the (degraded) 
single-strand RNA that is produced by treatment with an enzyme capable of degrading single-strand RNA is re- 
«5 moved from the system comprising the hybrids, resulting in the recovery of single-strand cDNA. Based on this 

method, cDNA nonspecifically bound to certain RNA may be selectively recovered. 

In this method, in the same manner as above, the enzyme capable of degrading single-strand RNA is either 
selected from among the group consisting of RNase I, RNaseA, RNase4, RNaseTI, RNaseT2, RNase2, and 
RNase3, or is a mixture thereof, with RNase I being preferred. Further, the cDNA may be long-strand, full-coding, 
50 and/or full-length DNA. 

The present invention further comprises a method of preparing normalized and/or subtracted cDNA in which 
an enzyme capable of degrading single-strand RNA driver nonspecifically bound to cDN A tester is added and the 
degraded single-strand RNA driver is removed. In this method, cDNA tester and hybrids comprising single-strand 
RNA driver nonspecifically bound to the cDN A tester are subjected to the action of an enzyme capable of degrading 
55 single-strand RNA driver, and following degradation, the degraded single-strand RNA driver is removed from the 

system comprising the hybrids. 

In this method, in the same manner as above, the enzyme capable of degrading single-strand RNA is either 
selected from among the group consisting of RNase !, RNaseA, RNase4, RNaseTI, RNaseT2, RNase2, and 
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RNase3, or is a mixture thereof, with RNase I being preferred. Further, the cDNA may be long-strand, full-coding, 
and/or full-length DNA. 

The method of the present invention may be employed to prepare one : two, or more cDNA libraries. The 
present invention also covers the cDNA and cDNA libraries obtained by the methods of the present invention. 
5 Finally, the methods according to the present invention permit the following: 

(i) high-efficiency removal of mRNA drivers; 

(ii) no relevant cDNA size reduction following hybridization that would affect the frequency of long, full-coding/ 
length cDNAs; 

(in) suitability to both normalization and subtraction; 
10 (iv) low cross-reactivity between similar but unidentical sequences; and 

(v) high-level performance, reproducibility, and ease of handling in terms of both the size of the drivers prepared 
and the number of libraries. 

[01 00] The methods and embodiments of the present invention will be further described with reference to the following 
15 examples. 

Example 1 

Preparation of RNA 

20 

[0101] Slices of brain tissue (0.5-lg) (or other tissues as described in Example 2) were homogenized in 10 mL of 
solution D (Chomczynski , P. and Sacchi, N., 1987, Annal. Biochem., 162:156-159) and extracted with 1 mL of 2M 
sodium acetate (pH 4.0) and the same amount of a mixture of phenol/chloroform (volume ratio 5:1). After extraction, 
the same volume of isopropanol was added to the aqueous phase to precipitate RNA. This sample was incubated on 

25 ice for an hour and centrifuged at 4000 rpm for 1 5 minutes with cooling to collect the precipitates. The precipitates 
were washed with 70 percent ethanol and dissolved in 8 mL of water. A 2 mL quantity of 5M NaCI and 16 mL of an 
aqueous solution (pH 7.0) comprising 1 percent CTAB (cetyltrimethylammonium bromide), 4M urea, and 50 mM Tris 
were added to precipitate RNA and the polysaccharides were removed (CTAB precipitate). After centrifugation at 4,000 
rpm for 15 minutes at room temperature, the RNA that had been obtained was dissolved in 4 mL of 7M guanidine-CI. 

30 Double the quantity of ethanol was then added to the solution and the mixture was incubated for an hour on ice and 
centrifuged at 4,000 rpm for 1 5 minutes. The resulting precipitates were washed with 70 percent ethanol and collected. 
The precipitates were again dissolved in water and the purity of the RNA was determined by measuring the OD ratio 
260/280 (>1 .8) and 230/260 (<0.45). 

35 Synthesis of cDNA 

[0102] From 5 to 10 u,g of this RNA, 5 u.g of first-strand primer containing the BamHI and Ssfl restriction sites 
(S'-fGAJsAGGATCCAAGAGCTCCO^VN-S') (SEQ ID NO:1), and 11.2 uX of 80 percent glycerol were combined to a 
total volume of 24 uX. The RNA-primer mixture was denatured at 65°C for 1 0 min. Simultaneously, 1 8.2 uX of 5X first-. 

40 strand synthesis buffer, 9.1 uX of 0.1 M DTT, 6.0 uX (each) of 10 mM dTTP, dGTP, dATP, and 5-methyl-dCTP (instead 
of dCTP), 29.6 uX of saturated trehalose (approximately 80 percent, low metal content; Fluka Biochemika), and 10.0 
uX of Superscript It reverse transcriptase (200 U/uX) were combined to a final volume of 76 uX. A 1 .0 uX quantity of 
[alpha- 32 P]dGTP was placed in a third tube. The mRNA, glycerol, and primers were mixed on ice with the solution 
containing the Superscript II, and an aliquot (20 percent) was quickly added to the tube containing the [alpha- 32 P] 

45 dGTP. First-strand cDNA synthesis was performed in a thermocycler with a heated lid (e.g., MJ Research) according 
to the following program: step 1 , 45°C for 2 min; step 2, gradient annealing: cooling to 35°C over 1 min; step 3, complete 
annealing: 35°C for 2 min; step 4, 50°C for 5 min; step 5, increase to 60°C at 0.1 °C per second; step 6, 55°C for 2 
min; step 7, 60°C for 2 min; step 8, return to step 6 and repeat for 1 0 additional cycles. Incorporation of radioactivity 
permitted estimation of the yield of cDNA (Caminci and Hayashizaki, 1999). The cDNA obtained was treated with 

50 proteinase K, extracted with phenol/chloroform and chloroform, and ethanol-precipitated using ammonium acetate as 
the salt (Caminci and Hayashizaki, 1999). 

Biotinylation of mRNA 

55 [0103] Before biotinylation, the diol group of the cap and 3'-end of the mRNA was oxidized in a final volume of 50 
uX of suspended mRN A/first-strand cDNA hybrid, 66 mM sodium acetate (pH 4.5), and 5 mM Nal0 4 . Samples were 
incubated on ice in the dark for 45 min. mRNA/cDNA hybrids were then precipitated by adding 0.5 uX of 1 0 percent 
SDS, 11 uX of 5M NaCI, and 61 uX of isopropanol. After incubation in the dark on ice for 45 min or at -20°C or -80°C 
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for 30 min, the samples were centrifuged for 1 0 min at 1 5,000 rpm. Finally mRNA/cDNA hybrids were rinsed twice with 
70 percent ethanol and resuspended in 50 u.L of water. The cap was then biotinylated in a final volume of 210 uL by 
adding 5 \iL of 1 M sodium acetate (pH 6.1), 5 u,L of 10 percent SOS, and 150 uL of 10 mM biotin hydrazide long-arm 
(Vector Biosystem). 

5 [0104] Foliowing overnight (10 to 16 hours) incubation at room temperature (22 to 26°C), the mRNA/cDNA hybrids 
were precipitated by adding 75 jiL of 1M sodium acetate (pH 6.1 ) : 5 jjlL of 5M NaCI, and 750 u,L of absolute ethanol 
and incubated on ice for 1 hour or at -20 to -80°C for 30 min. The mRNA/cDNA hybrids were pelleted by centrifugation 
at 15,000 rpm for 10 min and the pellet was washed once with 70 percent ethanol and once with 80 percent ethanol. 
The mRNA/cDNA hybrids were resuspended in 70 p.L of 0.1 X TE (1 mM Tris [pH 7.5), 0.1 mM EDTA). 

w 

Capture and Release of Full-Length cDNA 

[0105] A 500 ul. quantity of MPG-streptavidin beads and 100 u,g of DNA-free tRNA were combined and the mixture 
was incubated on ice for 30 min with occasional stirring. The beads were separated using a magnetic stand for 3 
15 minutes and the supernatant was removed. The beads were then washed three times with 500 jiL of washing/binding 
solution (2 M NaCI, 50 mM EDTA, pH 8.0). 

[0106] At the same time, 1 unit of RNase I (Promega) was added per u,g of starting mRNA to the mRNA/cDNA hybrid 
sample in buffer supplied by the manufacturer (final volume 200 jiL) and the sample was incubated at 37°C for 1 5 min. 
To stop the reaction, the sample was placed on ice and 100 u.g of tRNA and 100 u.Lof 5M NaCI were added. To capture 

20 full-*coding/length mRNA/cDNA hybrids, the biotinylated, RNase l-treated mRNA/cDNA and the washed beads were 
combined and resuspended in 400 of the washing/binding solution. After mixing, the tube was gently rotated for 30 
min at room temperature. The full-coding/length cDNA remained on the beads, and the shortened cDNAs did not. The 
beads were separated from the supernatant on a magnetic stirrer. The beads were gently washed to remove nonspe- 
ciflcally adsorbed cDNA: two washes with washing/binding solution; one with 0.4 percent SDS and 50 u,g/mL tRNA;* 

25 one with 1 0 mM Tris-HCI (pH 7.5), 0.2 mM EDTA, 40 u,g/mL tRNA, 10 mM NaCI, and 20 percent glycerol; and one with 
50 ng/mL tRNA in water. 

[01071 T n © cDNA was released from the beads by adding 50 u,L of 50 mM NaOH and 5 mM EDTA and incubating 
for 1 0 min at room temperature with occasional mixing The beads then were removed magnetically, and the extracted 
cDNA was transferred on ice to a tube containing 50 u.L of 1 M Tris-HCI, pH 7.5. The dissolution cycle was repeated 

30 once or twice with 50 u,L aliquots of 50 mM NaOH and 5 mM EDTA until most of the cDNA (80 to 90 percent, as 
measured by monitoring the radioactivity with a hand-held monitor) had been recovered from the beads. 
[0108] To remove traces of RNA that could later interfere with the biotinylated RNA driver, 100 \iL of 1 M Tris-HCI, 
pH 7.0, and 1 uJ- of RNase I (10 U/ui.) were quickly added to the recovered cDN A on ice; the sample then was incubated 
at 37°C for 10 min. The cDNA was treated with proteinase K, phenol/chloroform-extracted, and back- extracted. Two 

35 to three p.g of glycogen were then added and the sample precipitated from ethanol in a siliconized tube. Alternatively 
the sample was concentrated by one round of ultrafiltration with a Microcon 1 00 (Millipore) for 40-60 min at 2,000 rpm. 
When precipitated from ethanol, the cDNA could be redissolved in 20 jtL of 0.1 X TE. In this experiment, RNase H 
digestion was not conducted. However, hydrolysis was conducted with NaOH, which is capable of simultaneously 
hydrolyzing and denaturing double strands. 

40 

CL-4B Spin-Column Filtration of cDNA 

[0109] The cDNA samples were treated by CL-4B chromatography (Caminci and Hayashizaki, 1999) or on an S- 
400 spin column (Amersham-Pharmacia) essentially as described by the manufacturer. 

45 

Ollgo-dG Tailing of First-Strand cDNA 

[01 10] The cDNA sample, 5 u,L of OX TdT buffer (2 M potassium cacodylate [pH 7.2], 1 0 mM MgCI 2 , 1 0 mM 2-mer- 
captoethanol), 5 u,L of 50 u,M dGTP, 5 pi. of 1 0 mM C0CI2, and 40 U terminal deoxynucleotidyl transferase were admixed 
50 to a final volume of 50 u,L. Samples were incubated at 37°C for 30 min. At the end, the reaction was stopped with EDTA 
20 mM. The cDNA was treated with proteinase K, extracted with phenol chloroform, and precipitated in ethanol. The 
sample was finally redissolved in TE (10 mM Tris pH 7.5-8.0, EDTA 1 mM). After the tail length had been checked as 
described (Caminci and Hayashizaki, 1999), the cDNA was employed in second-strand synthesis for use in verifying 
libraries (see below) or employed as cDNA tester for use innormalization and/or subtraction. 

55 

Normalization Drivers 

[0111] mRNA drivers comprising aliquots of starting mRNA are called "normalizing or normalization drivers." To 
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calculate the concentration of the normalizing driver, the ribosorn a l/structural RNA contamination in the starting mRNA 
was approximated by assuming that the incorporation rate of the first-strand synthesis reflected the actual mRNA 
concentration, thus assuming 1 00 percent efficiency of priming and elongation. Assuming that the proportion of mRNA 
converted to first-strand cDNA corresponded to the actual mRNA concentration, less-than-full-length cDNAs were 
5 excluded from consideration. Although not all of the mRNA is normally primed, a slight excess of normalization driver 
seldom has as dramatic an effect as a paucity of driver. Therefore, it was assumed that the amount of mRNA in the 
sample was the same as the quantity of first-strand cDNA produced. 

Subtraction Drivers 

10 

[0112] The subtractive drivers consisted of bulk run-off transcripts prepared from cloned minilibraries and rearrayed 
libraries prepared from the nonredundant RIKEN cDNA encyclopedia using T7 and T3 RNA polymerases. 
[0113] The minilibraries contained approximately 1 ,000 to 2,000 clones of cDNA in the sample that were derived 
from previous normalization experiments conducted by the same methods as the experiment described in the present 

is embodiment. Employing the standard protocol, minilibraries were prepared from the captured aliquot (abundant cDNA 
fraction) that was the by-product of normalization experiments. Following normalization, the abundant cDNA fraction 
was removed from the beads with 50 mM NaOH/5 mM EDTA. Following neutralization, second-strand cDNA was 
prepared. Cloning was accomplished in a manner analogous to what has been previously described (Carninci and 
Hayashizaki, 1 999). Plasmid was then bulk-excised, and 1 ,000-2,000 clones per minilibrary were amplified on agarose/ 

20 ampicillin. To prepare drivers, 20,000 to 50,000 colonies were plated (plate size 150 mm diameter) on SOB-agarose/ 
ampicillin (Sambrook et at. 1 989, "Molecular Cloning: A laboratory Manual- Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY) and the plates were incubated overnight at 37°C. Bacterial cells were scraped from the plates in 
the presence of resuspension solution (Wizard DNA extraction kit; Promega) after which the manufacturer's protocol 
was followed. 

25 

Preparation of Nonredundant cDNA Library Driver 

[0114] Single clones from full-length cDNAs obtained in previous experiments conducted in the same manner as 
described in the present embodiment were rearrayed for subtraction. Rearrayed cDNAs were coated from 384-well 
30 plates onto SOD agarose/ampicillin plates. Plasmid extraction, DNA cleavage, and RNA preparation were conducted 
in the same manner as for the minilibraries. 

[0115] When the library had been cloned at the Ssfl site, the extracted plasmid was treated at multiple cloning sites 
on the 3'-end with Ssfl. (In the case of mRNA extracted from the liver and lungs, the minilibrary was cloned with Xho\ 
at the 3' -end site, and Pvu\ was used.) RNA was synthesized using either T3 or T7 RNA polymerase (Life Technologies), 
35 depending on the map of the construct used to prepare the driver, to prepare sense run-off RNAs. T3 polymerase was 
employed for Pvul-cleaved minilibraries and T7 polymerase for Ssfl-cleaved minilibraries. RNA was prepared using 
RNA polymerases (Life Technologies) in accordance with the manufacturer's instructions. Extensive digestion with 1 
to 2 uL of DNasel (RQ1, RNase-free, Promega) was performed for 30 min. Proteinase K digestion was then conducted, 
followed by extraction with phenol/chloroform and chloroform, and cDNA was precipitated. 

40 

Biotin Labeling of Normalizing/Subtracting RNA Drivers 

[01 16] To further clean up RNA drivers before labelling, the RNeasy kit (QIAGEN) was employed in accordance with 
the manufacturer's instructions. Subsequently, the Mirus nucleic acid biotinylation kit (Panvera) was employed in es- 
45 sentially the manner described by the manufacturer. A 1 0 u.g quantity of the RNA mix was labeled by combination with 
1 0 u.L of Label IT reagent and 1 0 u,L of labeling buffer A to a final volume of 1 00 \iL following the kit protocol instructions. 
The reaction was incubated at 37°C for 1 hour, after which the biotinylated RNA was precipitated by adding 1 /20-volume 
of SM NaCI and two volumes of 99 percent ethanol. After standard ethanol precipitation, the pellet was washed once 
with 80 percent ethanol, resuspended in 20 jxL of "1X Mirus labelling buffer A", and stored at -80°C until use (alterna- 
te tively, mRNA can be labelled using the psoralen-biotinylation kit (Ambion) in accordance with the manufacturer's in- 
structions). 

Normalization/Subtraction 

55 [01 1 7] The RNA drivers and cDNA were deproteinated using proteinase K followed by phenol/chloroform extraction, 
chloroform extraction, and ethanol precipitation. Oligo-dG-tailed cDNA was used as a substrate, which was mixed with 
the RNA drivers and blocking oligonucleotides (biotin-dG 5 to -dG^, here biotin-dG 16 was used) to hybridize to the C- 
stretch present in the subtracting driver and with oligo-dT primer to block the polyA sequences. However, any oligo- 
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nucleotide able to block common sequences between drivers and cDNAs can.be used. 

[0118] Hybridization was typically carried out at RoT values of 1 to 500 (RoT is defined in 1997 examples by Sag- 
erstrom et al.) in a buffer containing 80 percent formamide (from a deionizedstock), 250 mM NaCI, 25 mM HEPES 
(pH 7.5), and 5 mM EDTA.. Hybridization was carried out at 42°C in a dry oven; even volumes as small as 5 u,L did 
s not require mineral-oil overlays. After hybridization, the sample was precipitated by adding 2.5 volumes of absolute 
ethanol and incubated for 30 minutes on ice. The sample was then centrifuged for 10 min at 15,000 rpm and washed 
once with 70 percent ethanol; the cDNA (both single-strand cDNA and mRN A/single-strand cDNA hybrids) was carefully 
resuspended in 10 uA. of water on ice. 

10 Treatment with RNase I 

[01 1 9] As necessary, the tester/driver hybrids obtained in the above step can be treated with RNase I to remove the 
mRN A normalization and subtraction drivers bound nonspecifically to the tester cDNA. 

[01 20] After removing the supernatant from the sample, precipitated after hybridization as described above, the pellet 
'5 was resuspended with 45 u.L of double-distilled water or TE 0.1 X (1 mM Tris, 0.1 mM EDTA, pH 7.5) on ice (to minimize 
nonspecific repeat annealing). The pellet was completely redissolved before proceeding to the next step. 
[0121] A 5 u,L quantity of 1 0 x RNase I Buffer (Promega) and 0.5 unit of RNase I were then added per 1 0 [xg of driver 
RNA. 

[0122] The mixture was incubated at 37°C for 10 min, heated at 65°C for 10 min, and placed on ice. (As necessary, 
20 the samples can be treated with proteinase K, phenol/chloroform, and chloroform, and precipitated with ethanol before 
proceeding to the next step). . 

Removal of Hybrid 

25 [0123] The next step can be applied to the normalized/subtracted mixture whether or not it has been treated with 
RNase I as indicated in the above step. 

[01 24] Separately, 50 p.L of MPG-strepavidin magnetic beads (CPG Inc.) were prepared for each 1 ixg of biotinytated 
driver RNA; a 5 u.L quantity of beads was found to be capable of binding more than 400 ng of biotinylated driver. To 
each 50 u.l of beads was added 1 0 u.g of tRNA as a blocking reagent and the beads were incubated at room temperature 
30 for 10 to 20 min or on ice for 30 to 60 min with occasional shaking. A magnetic stand was employed to remove the 
beads, which were washed three times with a large excess of 1 M NaCI and 1 0 mM EDTA and resuspended in a volume 
of 1 M NaCI and 10 mM EDTA equivalent to the original volume of the bead suspension. 

[01 25] The blocked beads were mixed with the redissolved tester/driver mixture and the entire sample was incubated 
at room temperature for 15 min with occasional gentle mixing. After removing the beads using a magnetic stand for 3 
35 min, the supernatant, which contained the single-strand normalized/subtracted cDNA, was recovered. The beads were 
washed once with an excess volume of binding buffer (1 M NaCI, 1 0 mM EDTA) to recover any remaining ssDNA. The 
radioactivity of the labeled samples was measured before and after the procedure in order to estimate the yield of 
normalization/subtraction. 

[0126] Microcon 100 ultrafiltration was employed as described by the manufacturer (Millipore) to concentrate the 
40 cDNA solution to approximately 50 jiL. Subsequently, the cDNA was pelleted by the standard isopropanol procedure. 
The pellet was resuspended in 44 uJ of 0.1 x TE, to which 5 u.L of RNase I buffer and 1 U of RNase I were added to 
make a volume of 50 j±L. The samples were then incubated for 20 min at 37°C, after which 400 \iL of 0.2 percent SDS 
was added to inactivate the RNase I. Traces of degraded RNAs, blocking oligonucleotide, SDS, and buffer were re- 
moved by ultrafiltration with a Microcon 1 00 filter at 2,000 rpm and 25°C until the volumes were reduced to less than 
4 * 20 p.L. The samples were desalted by adding 400 uJ of 0.1 X TE then centrifuging as above for a total of three washes. 
The cDNA was recovered by Inverting the filter in a new tube and centrifuging at 9,000 rpm for 1 min. 

Synthesis of Second-Strand cDNA 

50 [01 27] The second-strand synthesis and cloning steps were identical for normalized/subtracted cDNA, the standard 
control libraries, andthe mini libraries. In the same man ner as fort he first-strand cDN A primer, a Xho\ -containing primer, 
5 , -(GA) 7 TTCTCGAGTTAATTAAATTAATC 1 3-3 , (SEQ ID NO:2), was prepared and purified by standard techniques. 
[0128] To prepare the second-strand reaction, oligo-dG-tailed cDN A was mixed with 6 uJ. of 1 00 ng/jiL second-strand 
primer adapter. 6 uJL of EX-Taq second-strand buffer (Takara), and 6 \lL of 2,5 mM (each) dNTPs. The reagents were 

5 ^ combined with enzyme at 50°C (usually from 45°C to about 80°C ) to ensure high specificity of priming (called "hot 
start"). Priming was then conducted by adding 3 uX of 5 U/|iL ExTaq polymerase (Takara) at 65°C in a thermocycler. 
After mixing, the annealing temperature was set by a negative ramp to 45°C for the Xhck primer (35°C for the Ssft 
primer in case of liver and lung libraries of Example 2). After 10 minutes at the annealing temperature, the second- 
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strand cDNA was extended during incubation at 68°C for 20 min. The annealing-extension cycle was repeated once 
more, followed by a final elongation step at 72°C for 1 0 min. At the beginning of the hot start, a 5 u.L aliquot was mixed 
with 0.5 u.L of [alpha 32 P]dGTP or [a!pha 32 P]dCTP and incorporated. The labeled aliquot was employed at the end of 
the reaction to measure the cDNA and to calculate the second-strand yield (Carninci and Hayashizaki, 1 999). 

5 

Cloning of cDNA 

[0129] Second-strand cDNA was treated with proteinase K, extracted with phenol-chloroform and chloroform, and 
ethanol-precipitated in accordance with standard procedures. The cDNA was then cleaved using 25 U/p.g each of 
10 BamH\ and Xho\ (or Sstl and Xhol for lung and liver libraries of Example 2). Following digestion, cDNA was treated 
with proteinase K, extracted with phenol-chloroform, and purified on a CL-4B spin column (Pharmacia). After ethanol 
precipitation, the cDNA was cloned essentially as described in the literature (Carninci and Hayashizaki, 1999). 

Methodology and Equipment Utilized 

[01 30] Plaque hybridization was conducted with random primer and labeled specific probes in accordance with stand- 
ard protocols (Sambrook et at. 1 989). 

[01 31] Alkali electrophoresis was performed as described in the literature (Sambrook et at. 1 989). All autoradiography 
signals were visually displayed using the Bas 2000 imaging system (Fuji). 
20 [0132] Bacteria were collected with commercially available picking machines (Q-bot and Q-pix; Genetics, UK) and 
transferred to 384-microwell plates. 

[0133] Duplicate plates were used to prepare plasmid DNA. The plasmid DNA from each of the 384-weII plates was 
divided and grown in four 96-deepwell plates. After overnight growth, plasmids were extracted either manually (Itoh et 
at. 1997, Nucleic Acids Res 25:1315-1316) or automatically (Itoh et al. 1999, Genome Res. 9:463-470). 
25 [0134] Sequences were typically run on a RISA sequencing unit (Shimadzu, JAPAN) or using the Perkin Elmer- 
Applied Biosystems ABI 377 in accordance with standard sequencing methodologies such as described by Hillier et 
al., 1996, Genome Research, 6:807-828. The sequencing primers were the M13 forward and reverse primers (above 
described SEQ ID NO: 5 and SEQ ID NO: 6). 

30 Example 2 

Lung and Liver Tissues 

[0135] cDNA normalized/subtracted libraries (and minilibraries) were prepared from lung and liver tissues in the 
35 same manner as described in Embodiment 1 for brain tissue, with the exception that a primer containing an Xho\ site 
(5 , (GA) 8 ACTCGAG(T) 16 VN-3 , ) (SEQ ID NO: 4) and an Ssfl-containing primer 
5 , -(GA) 9 GAGCTCACTAGTTTAATTAAATTAATC 1 1 -3' (SEQ ID NO: 3) were employed for mRNA extracted from liver 
and lung tissues. The other steps were the same as those described for brain tissue. 

40 Example 3 

Efficiency of Removal in Driver/Tester Capture 

Preparation of an RNA Template 

45 

[0136] A pBluescript plasmid containing a 5 Kb fragment of reeler cDNA (Hirotsune et aJ. t Nature Genetics, 1995, 
May, 1 0(:77-83)) was employed. 

[01 37] From 2.5 uL of template plasmid DNA (cleaved at the Notl restriction site), RNA was transcribed in vitro under 
standard conditions: 20 \lL of Gibco-BRL 5 X buffer, 5 u,L of rNTPs (10 mM each), 5 uL of 0.1 M DTT, and 20 Units of 
so T7 RNA polymerase at a final volume of 1 00 uJ.. The reaction was conducted by incubation at 37°C for 3 hours. Also, 
2 u.L of alpha- 32 P-rUTP was added in the reaction to label the RNA. 

[0138] Subsequently, 20 units of RQ1 DNase (Promega) were added to remove trace quantities of template DNA 
(plasmid) and the sample obtained was incubated at 37° C for 15 min. NaCI was added to the sample to a final con- 
centration of 250 mM. The sample obtained was deproteinated once with phenol (equilibrated with Tris)/chloroform 
55 and once with chloroform. Two volumes of ethanol RNA were then added to precipitate RNA. After centrifugation for 
20 min at 15,000 rpm, the precipitated RNA was separated from the supernatant. The precipitate was washed once 
with 70 percent ethanol, followed by centrifugation. The pellet was then redissolved in water 
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Preparation of cDN A (Tester) 

[0139] cDNA was prepared from the RNA template as specified in the instruction manual of Superscript II (Gibco 
BRL-Life Technology), with the exception that the primer that was specific to the clone was SK primer 
* ( 5'C GCTCTAG AACTAGTG G ATC3') (S EQ ID NO:7) and that alpha- 32 P dGTP was used to label the first strand for later 
tracing. Following phenol/chloroform extraction and ethanol precipitation in accordance with standard procedures, the 
cDNA was treated with alkali (50 mM NaOH for 30 min) to remove the hydrolized, hybridized RNA and neutralized with 
200 mM Tris at pH 7.00. A 20 U quantity of RNase I was added. In the end, the DNA was again extracted with phenol/ 
chloroform and precipitated with ethanol under standard conditions. 

10 

Biotinylation of RNA 

[0140] The RNA template was also employed as a driver. Aliquots of 500 ng of RNA were biotinylated with a biotin- 
psoralen kit (Ambion) on ice (samples 1-3) or at room temperature (samples 4-6) for 30 min (1 , 4), 45 min (2, 5) and 

15 60 min (3, 6) (see Fig. 6); 

[0141] Following biotinylation, 50 ng of cDNA (6,000 CPM) and 10 jig of tRNA were added to 150 ng of biotinylated 
driver (prepared under conditions 1-6, thus employing 6 tubes) (counting 21 ,000 CPM),. After standard phenol/chlo- 
roform extraction and ethanol precipitation, the samples were redissolved in 5 u,L of hybridization buffer (80 percent 
formamide, 250 mM NaCI, 25 Mm Hepes pH 7.5, 5 mM EDTA) and incubated at 42*C overnight (14 hours). 

20 [0142] Following ethanol precipitation (conducted in the same manner in the other embodiments), the samples (6 
tubes) were then mixed with streptavid in/magnetic [beads] (the subtraction step described In Embodiment 1). The 
supernatant (unbound) was then precipitated with ethanol under standard conditions with the addition of 4 ng of gly- 
cogen to ensure quantitative precipitation, and after resuspension, loaded on a standard RNA/formaldehyde minigel 
(lanes 1 -6). After 1 hour of electrophoresis at 60 V, the gel was dried and exposed with the Bas 2000 image analyzer 

25 (Puji). This showed the efficiency of removal of driver and tester. The (lanes 7^9) side was employed for untreated 
sample (mRNA/cDNA) corresponding to 10 percent and 2 percent of the starting count of 100. The intensity of the 
signal indicated the efficiency of removal of the driver/tester mixture. 

Example 4 

30 

Methods of Evaluating the Invention 
Reduction in the Frequency of Abundant cDNAs 

35 [0143] Several normalized/subtracted cDNA libraries were prepared from pancreas tissue in the same manner as 
described above for brain tissue using RNA drivers and minilibraries derived from the rearrayed nonredundant cDNAs 
prepared in the above-described embodiments to reduce unnecessary resequencing of clones already present. 
[0144] The second-strand cDNA from a standard pancreas cDNA library (without normalization/subtraction) was 
compared to its normalized/subtracted counterpart (Fig. 3). The normalized/subtracted cDNA was prepared in a single 

4 o normalization/subtraction step. Normalization was performed at RoT =10, and subtraction was conducted using a set 
of minilibraries prepared as set forth above at RoT = 20 (subtraction can be performed at an RoT value of up to around 
500) each of which contained 1,000 to 2,000 redundant mostly abundant clones from liver, lung, brain, or placenta 
tissue. The minilibraries were generated by cloning the highly expressed fractions of previously prepared, normalized 
cDNA libraries. Amplified cDNA minilibraries were then used to prepare the subtracting drivers (as described above). 

4S The RoT of the subtracting drivers equaled 1 unit for every 200 clones (e.g., RoT =5 when 1,000 clones were employed). 
The average length of normalized, subtracted cDNA was greater than that of non-normalized, non-subtracted cDNA, 
suggesting that long cDNAs (which migrate more slowly) are expressed more rarely than the shortest cDNAs. In ad- 
dition, bands corresponding to cDNAs of highly expressed mRNAs were not visible from the normaltzed-subtracted 
library. 

50 [0145] Figure 3 shows an electrophoresis run without normalization/subtraction (standard cDNA). A few highly in- 
tense bands derived from superabundant RNAs are visible. By contrast, in the normalized-subtracted cDNA, those 
bands are no longer visible, suggesting a decrease in the cDNA. Moreover, in the normalized/subtracted cDNA, the 
relative intensity of cDNAs corresponding to long mRNAs (>-3 Kb) is greater than in the standard libraries. 
[0146] Another way of demonstrating the advantages of normalization/subtraction is shown in Fig. 4. 

5 $ [01 47] First-strand lung cDN A prepared as described above was employed as template. Genes that were abundantly 
expressed in plaque hybridization functionally corresponding to a normalized cDNA library decreased in the normalized 
library. When 10,000 plaques of the normalized lung library were screened, elongation factor 1 -alpha went from 90 
plaques in the reference library to 10 in the normalized library. Carbonyl reductase decreased from about 70 plaques 
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to 3. And uteroglobin was reduced from about 510 plaques to 2. The plaques were counted. There were more plaques 
in the standard cDNA libraries (about 1 0 times more in the standard library than in the normalized library). These results 
demonstrated that the frequency of abundantly expressed cDNAs in the normalized library was much lower than in 
the control. 

5 

Example 5 

Increasing the Frequency of Discovery of Rare Genes 

io [0148] Large-scale library sequencing is the best way to test the concentration of rarecDNAs. A number of libraries 
(Table 1) were prepared from several mouse tissues by the method set forth above and assessed by checking the 
average size of the cDNA inserts (insert size), sequences passes (Seq.), clusters (Sp.) f redundancy (Red.), appearance 
of new clones (Unique), and presence of full-coding/length cDNA based on the percent of sequences having the first 
ATG codon (coding percent). 



[Table 1] 





Library 
ID 


Development 
Stage /tissue 


Normalizing 
Driver (Rot) 


Insertion 
Size 

(KDp) 


Seq. 


Sp. 


Red. 


Unique (%) 


Coding 
(%) 


20 


18-100 


Adult/pancreas 


mRNA(5) 


1.2 


13556 


3402 


3.98 


442 


(13.0) 


(100.0) 




22-000 


adult/stomach 


(standard) 


0.88 


1458 


488 


2.99 


42 


(8.6) 


(82.1) 




uu 






1 .21 


4400 


1932 


2.28 


196 


(10.1) 


(82. t) 


25 


22-104 




mRNA(5) 


1.13 


3639 


1852 


2.11 


207 


(11.1) 


(82.1) 




23-000 


Adult/tongue 


(standard) 




1179 


556 


2.12 


36 


(6.5) 


76.8 




23-100 




mRNA(5) 


1.44 


10267 


4017 


2.56 


586 




76.8 


30 


24-100 


ES cell 


mRNA(5) 


1.77 


15226 


4495 


3.89 


485 


(10.8) 


(88.6) ■- 




25-100 


Embryo13/!iver 


mRNA(5) 


1.19 


5448 


1525 


3.57 


168 


(11.0) 


92.2 




26-000 


Embryo 10/ 


(standard) 


1.38 


2108 


1061 


1.99 


71 


(6.7) 


92.3 




26-100 


whole body 


mRNA(7.5) 


1.32 


11267 


4722 


2.99 


582 


(12.3) 


92.3 


35 


28-100 


Embryo 10+11/ 


mRNA(7.5) 


1.29 


6248 


3411 


1.83 


271 


(7.9) 


(93.9) 




28-104 


whole body 


mRNA(7.5) 


1.38 


9321 


4335 


2.15 


453 


(10.4) 


(93.9) 




31-000 


Embryo/ head 


(standard) 


1.22 


488 


369 


1.32 


23 


(6.2) 


(86.2) 


40 


31-100 




mRNA(10) 


1.55 


7838 


4229 


1.85 


494 


(11.7) 


(86.2) 


32-304 


Embryo 
14+17/head 


mRIMA(1 0) 


2.5 


424 


389 


1.09 


20 


(5.1) 


(88.2) 


45. 


38-304 


Embryo 11/ 
placenta & 
extraembryon 
tissue 


mRNA(10) 


1.45 


3657 


2165 


1.69 


156 


(7.2) 


(100.0) 




39-304 


Embryo 13/ 
whole body 


mRNA(10) 


2.47 


348 


319 


1.09 


22 


(6.9) 


(90.0) 


50 


49-304 


Adult/testis 


mRNA(1 0) 


2.11 


8900 


5444 


1.63 


1214 


(22.3) 


(95.7) 




52-304 


Adult/ Xiphoid 


total RNA(3) 


2.69 


272 


256 


1.09 


15 


(5.9) 


(100.0) 




53-304 


Adult/pituitary 
gland 


total RNA(3) 


2.38 


8059 


4858 


1.73 


833 


(17.9) 


(100.0) 


55 


54-304 


Neonate6/ head 


mRNA(10) 


2.3 


2663 


2101 


1.27 


196 


(9.3) 


(90.0) 



16 



EP 1 197 552 A2 

[Table 1] (continued) 



LJuraiy 

ID 


ijeveioprneni 
Stage /tissue 


inoii ndH^iny 

Driver (Rot) 


Insertion 
Size 
(kbp) 


Seq. 




neu. 


Unique (%) 


ooaing 

(%) 


57-304 


Embryo8/ whole 
body 


(subtracted 
only) 


1.91 


19532 


7758 


2.53 


1155 


(14.9) 


(100.0) 


58-304 


Adult/thymus '* 


mRNA(TO) 


3.27 


10259 


6442 


1.59 


1100 


(17.1) 


(80.0) 


60-304 


Embryo13/ 
testis 


total RNA(5) 




11079 


6498 


1.7 


1243 


(19.1) 


(75.0) 


61-304 


Embryo 14/ 
thymus 


(subtracted 
only) 


4.13 


206 


196 


1.05 


/; 16 


(8.2) 


(60.0) 


62-304 


Embryo 11 /head 


rriRNA(TO) 


2.19 


2957 


2374 


1.25 


256 


(10.8) 


(70.0) 



10 



15 



20 



25 



30 



35 



40 



45 



SO 
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[01 49] Assessing the degree of sequence redundancy was the final evaluation of the efficiency of the normalization/ 
subtraction process. Standard libraries (indicated by the reference numbers 22-000, 23-000, 26-000, and 31-000) 
prepared from an aliquot of the starting cDNA are shown for comparison (Table 1 ). 

[0150] In one successfully normalized-subtracted cDNA library (library 49-304 from mouse testicular tissue), the 
redundancy of 3*-end sequences was as low as 1.63 (calculated by dividing the total number of clones sequenced, 
8,900, by the number of different clusters, 5,444). It was anticipiated that redundancies of less than 2.0 in more than 
1 0,000 to 15,000 3'-end, sequences could be expected in successful cDNA libraries from complex tissues (e.g., testes, 
brain, and thymus). 

[0151] Normalized/subtracted cDNA libraries facilitated efficient and increased recovery of unknown genes. For ex- 
ample, libraries 22-1 00, 23-1 00, 26-1 00, and 31 -1 00 produced values of new data per sequencing reaction higher than 
standard library counterparts 22-000, 23-000, 26-000, and 31-000 (Fig. 5). 

[0152] Sequencing severalcDNAs from several libraries revealed a decrease in sequence redundancy in the nor- 
malized-subtracted libraries as compared to the standard cDNA libraries (Fig.5). \ 
[0153] In Figure 5, 1 00 percent new gene discovery corresponds to a redundancy value of 1 , 50 percent corresponds 
to a redundancy value of 2, 25 percent corresponds to a redundancy value of 4, and so forth. 

[0154] Normalization increases the frequency of new gene discovery relative to standard libraries during a given 
sequencing effort. 

Example 6 

Comparative Example of the Normalization-Subtraction Methods of the Present Invention 

[0155] The importance of the use of an enzyme cleaving single-strand RN A (driver) bound nonspecifically to cDNA 
(tester) was checked. Accordingly, sub-libraries prepared by the normalization/subtraction method of the present in- 
vention were compared with sub-libraries prepared by the normalization/subtraction method and the step of removing 
non-specific hybrid. 

[0156] Normalized/subtracted cDNAs prepared in accordance with the first part of Example 1 (through the portion 
including the normalization/subtraction step) were divided into two subtraction libraries. The first sublibrary was sub- 
jected to second strand cDNA synthesis and cloning (without removal of non-specifically bound hybrids), while the 
second library was subjected to the RNase I treatment (removal of non-specif icaily bound hybrids) described in Example 
1 . The mouse tissues prepared were as follows: medulla oblongata for library 63, olfactory brain for library 64, colon 
for library 90, and cecum for library 91 . The data are reported in Table 2. 

[Table 2] 





Reference Number 




Cluster 


Unique Clone 


p/o Unique Clone 




-RNase I 


63-304-R 


RISA 


2987 


252 


(8.4) 


)15.4% 


+RNase I 


63-305-R 


RISA 


5358 


518 


(9.7) 


-RNase I 


64-304-R 


RISA 


1258 


80 


(6.4) 


)81.1% 


+RNase I 


64-305- R 


RISA 


6371 


742 


(11.6) 
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[Table 2] (continued) 





Reference Number 




Cluster 


Unique Clone 


p/o Unique Clone 




-RNase I 


90-300-R 


RISA 


1348 


106 


(7-9) 


)27.0% 


+RNase I 


90-302-R 


RISA 


779 


72 


(9.2) 


-RNase I 


90-304-R 


RISA 


1066 


57 


(5.3) 


)62.0% 


+RNase I 


90-306-R 


RISA 


1479 


127 


(8.6) 


-RNase I 


91-300-R 


RISA 


1470 


95 


(6.5) 


)11.5% 


+RNase I 


91-302-R 


RISA 


1786 


134 


(7.5) 
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[0157] Sublibraries 63-304-R, 64-304-R, 90-300-R, 90-304-R, 91-300-R denote sublibraries that were not treated 
with RNase I. 

[0158] Sublibraries 63-305-R, 64-305-R, 90-302-R, 90-306-R, 91-302-R denote sublibraries treated with RNase !. 
[0159] The combinations of sublibraries belonging to the same library are: 63-304-R and 63-305-R; 64-304-R and 
64-305-R; 90-300-R and 90-302-R; 90-304-R and 90-306-R; and 91-300-R and 91-302-R. 

[01 60] The number of clusters for each sublibrary denotes the number of different clones (that is, clusters) comprising 
the sublibrary. Each cluster can comprise one or more clones having the same sequence (that is, the same sequence 
picked up a number of times). 

[0161] "Unique clones" indicate the number of new clones obtained from a sublibrary cluster that have not been 
previously sequenced. 

[0162] The "percentage of unique clones" (for example, the value "8.4" for the sublibrary 63-304-R) denotes the 
number of unique clones discovered (for example, u 252 u for sublibrary 63-304-R) divided by the number of clusters 
("2987" for sublibrary 63-304-R). 

[0163] The data of Table 2 indicate that all tests of treatment with RNase I yielded high percentages of unique clones 
(for example," 1 5.4 percent for sublibraries 63-304-R and 63-305-R). This indicates that a number of undiscovered 
(unique) clones nonspecifically bound to mRNA driver were released from hybrids by the RNase I treatment, recovered, 
and discovered. 

[0164] The clone sequences were run on a RISA sequencing unit (Shimadzu, JAPAN). M13 forward and reverse 
primers were employed as sequencing primers. Forward: M13 oligo (5* TGTAAAACGACGGCCAGT 3*) (SEQ ID NO: 
5); reverse: 1233REV oligo (5* AGCGGATAACAATTTCACACAGGA 3') (SEQ ID NO: 6). 

[0165] Sequencing was conducted in accordance with a known standard sequencing protocol (Hillier et al., 1996, 
Genome Res. : 6:807-828). 
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SEQUENCE LISTING 
<110> The Institute of Physical and Chemical Research (RIKEN), Hayashizaki, Yoshihide 
<120> Method for the preparation of normalized and/or subtracted cDNAs 
<130>FA1135H/1-12 
<160>7 

< 1 70> Patentln Ver. 2.1 

<210> 1 

<211>43 

<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: first-strand 

primer comprising BamHI and SstI restriction sites 

<220> - 
<22 1 > misc_feature 
<222> (42) 

<223> Nucleotide 42 is v wherein v = g or c or a 
<220> 

<221> misc_feature 
<222> (43) 

<223> Nucleotide 43 is n wherein n = any nucleotide 
<400> 1 

gagagagaga aggatccaag agctcttttt tttttttttt tvn 43 

<210>2 

<211>49 

<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
comprising the Xhol restriction site 

<400> 2 

gagagagaga gagattctcg agttaattaa attaatcccc ccccccccc 49 
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<2l0>3 
<2ll>55 
<212> DNA 

<2 1 3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
comprising the Sstl restriction site 

<400> 3 

gagagagaga gagagagaga gctcactagt ttaattaaat taatcccccc ccccc 

<210>4 

<211>41 

<212>DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
comprising the Xhol restriction site 

<220> 

<221> misc_feature 
<222> (40) 

<223> Nucleotide 40 is v wherein v - g or c or a 
<220> 

<22 1 > misc_feature 
<222> (41) 

<223> Nucleotide 41 is n wherein n = any nucleotide 
<400> 4 

gagagagaga gagagaactc gagttttttt tttttttttv n 41 

<210>5 
<2il>18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: M13 forward 
primer 

<400> 5 

tgtaaaacga cggccagt 1 8 
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<2l0>6 
<211>24 
<2 1 2> DNA 

<2 1 3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 1233REV 
reverse primer 

<400> 6 

agcggataac aatttcacac agga 24 

<210>7 
<211>20 
<212> DNA 

<2 1 3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: SK primer 
<400> 7 

cgctctagaa ctagtggatc 20 



30 



Claims 



1 . A method of preparing normalized and/or subtracted cDNAs characterized by comprising the steps of: 



40 



I) preparing uncloned cDNAs (testers); 

II) preparing polynucleotides (drivers) for normalization and/or subtraction; 

III) conducting normalization and/or subtraction and removing tester/driver hybrids and non-hybridized poly- 
nucleotide drivers; and 

IV) recovering the normalized and/or subtracted cDNA. 



2, The method of claim 1 , wherein the cDNA tester of step I) is reverse transcript of mRNAin form of uncloned cDNA. 
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3. The method of claim 1 wherein said cDNA tester is single strand. 



4. The method of any of claims 1-3 wherein in step III), normalization is conducted first, followed by subtraction. 



5. The method of any of claims 1-3 wherein in step III), subtraction is conducted first, followed by normalization. 
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6. The method of any of claims 1-3 wherein in step III), said tester and normalization and subtraction drivers are 
mixed together and normalization and subtraction are conducted as a single step. 
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7. The method of any of claims 1 -6 wherein said normalized and/or subtracted cDNA is long-strand, full-coding, and/ 
or full-length cDNA. 



8. The method of any of claims 1 -7 wherein step III) comprises the addition of an enzyme capable of cleaving single- 
strand RNA driver nonspecifically bound to single strand cDNA and the cleaved single strand RNA driver is re- 
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moved. 

9. The method of claim 8 wherein said enzyme is single-strand-specific RNA endonuclease. 

5 10. The method of claim 8 wherein said enzyme is either selected from the group consisting of RNase I, RNaseA, 
RNase4, RNaseTI , RNaseT2, RNase2, and RNase3, or comprises a mixture thereof. 

11. The method of claim 8 wherein said enzyme is RNase I. 

io 12. The method of any of claims 1-11 wherein said cDNA tester is prepared by CAP-trapping the 5* end of RNA. 

13. The method of any of claims 1-12 wherein the preparation of said cDNA tester comprises the following steps: 

(1) synthesizing first strand cDNA by means of reverse transcriptase forming mRNA/cDNA hybrids; 
15 (2) chemically binding a tag molecule to the dio! structure of the 5' CAP( 7Me G ppp N) site of mRNA forming 

hybrids;. 

(3) trapping long-strand, full-coding, and/or full-length cDNA hybrids; and 

(4) removing single strand mRNA through digestion with an enzyme capable of cleaving single strand mRNA. 

20 14. The method of claim 13 wherein said tag molecule is digoxigenin, biotin, avidin, or streptavidin. 

15. The method of any of claims 1-14, wherein said polynucleotide driver for normalization and/or subtraction is RNA 
and/or DNA. 

25 16. The method of claim 15, wherein said DNA driver is cDNA. . 

17. The method of any of claims 1 -1 6 wherein said normalization driver comprises cellular mRNA from the same library, 
the same tissue, or the same cDN A population as what is to be normalized. 

30 18. The method of any of claims 1-16 wherein said normalization driver comprises single strand cDNA obtained from 
the same library, the same tissue, or the same cDNA population as what is to be normalized. 

19. The method of any of claims 1 -1 6 wherein said subtraction driver comprises cellular mRNA from a library, tissue, 
or cDNA population differing from what is to be subtracted. 

35 

20. The method of any of claims 1-16 wherein said subtraction driver comprises single strand cDNA from a library, 
tissue, or cDNA population differing from what is to be normalized. 

. 21. The method of any of claims 1-20, further comprising a step V) of preparing a second strand of recovered cDNA 
40 and performing cloning. 

22. A method of preparing normalized and/or subtracted cDNAs characterized by comprising the steps of: 

I) preparing cDNAs (testers) not cloned in plasmid; 
45 ||) preparing polynucleotides (drivers) for normalization and/or subtraction; 

III) conducting normalization and/or subtraction and removing tester/driver hybrids and non-hybridized poly- 
nucleotide drivers; and 

IV) recovering the normalized and/or subtracted cDNA. 

50 23. The method of claim 22, wherein in step III), normalization is conducted first, followed by subtraction. 

24. The method of claim 22, wherein in step III), subtraction is conducted first, followed by normalization. 

25. The method of claim 22, wherein in step III), said tester and normalization and subtraction drivers are mixed together 
55 and normalization and subtraction are conducted as a single step. 

26. The method of any of claims 22-25, wherein said normalized and/or subtracted cDNA is long-strand, full-coding, 
and/or full-length cDNA. 
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27. The method of any of claims 22-26, wherein step III) comprises the addition of an enzyme capable of cleaving 
single-strand RNA driver nonspecifically bound to single strand cDNA and the cleaved single strand RNA driver 
is removed. 

28. - The method of claim 27, wherein said enzyme is single-strand-specific RNA endonuclease. 

29. The method of claim 27, wherein said enzyme is either selected from the group consisting of RNase I, RNaseA, 
RNase4, RNaseTI , RNaseT2, RNase2, and RNase3, or comprises a mixture thereof. 

30. The method of claim 27, wherein said enzyme is RNase I. 

31 . The method of any of claims 22-30, wherein said cDN A tester is prepared by CAP-trapping the 5* end of RNA. 

32. The method of any of claims 22-31, wherein said normalization driver comprises cellular mRNA from the same 
library, the same tissue, or the same cDNA population as what is to be normalized. 

33. The method of any of claims 22-31 , wherein said normalization driver comprises single strand cDNA obtained from 
the same library, the same tissue, or the same cDN A population as what is to be normalized. 

34. The method of any of claims 22-31 . wherein said subtraction driver comprises cellular mRNA from a library, tissue, 
or cDNA population differing from what is to be subtracted. 

35. The method of any of claims 22-31, wherein said subtraction driver comprises single strand cDNA from a library, 
tissue, or cDN A population differing from what is to be normalized. 

36. The method of any of claims 22-35, further comprising a step V) of preparing a second strand of recovered cDNA 
and performing cloning. 

37. A method of preparing normalized and subtracted cDNA comprising the steps of: 

I) preparing cDNAs (tester); 

II) preparing polynucleotides (drivers) for normalization and subtraction; 

III) conducting the normalization and subtraction as a single step by mixing together the tester and the drivers; 
and 

IV) recovering the normalized and subtracted cDNA. 

38. The method of claim 37; wherein the cDNA tester is cloned or uncloned cDNA. 

39. The method of claim 37, wherein cDNA tester is reverse transcript of mRNA in form of uncloned cDNA. 

40. The method of claim 37, wherein the cDNA tester is single strand. 

41. The method of any of claims 37-40 : wherein said normalized and subtracted cDNA is long-strand, full-coding, and/ 
or full-length cDNA. 

42. The method of any of claims 37-41, wherein step III) comprises the addition of an enzyme capable of cleaving 
single-strand RNA driver nonspecifically bound to single strand cDNA and the cleaved single strand RNA driver 
is removed. 

43. The method of claim 42, wherein said enzyme is single-strand-specific RNA endonuclease. 

44. The method of claim 42, wherein said enzyme is either selected from the group consisting of RNase I, RNaseA, 
RNase4, RNaseTI, RNaseT2, RNase2, and RNase3, or comprises a mixture thereof. 

45. The method of claim 42, wherein said enzyme is RNase I. 

46. The method of any of claims 37-45, wherein said cDNA tester is prepared by CAP-trapping S* end of RNA. 
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47. The method of any of claims 37-46, wherein the preparation of said cDNA tester comprises the following steps: 

(1) synthesizing first strand cDNA by means of reverse transcriptase forming mRNA/cDNA hybrids; 

(2) chemically binding a tag molecule to the diol structure of the 5' CAP( 7Me Gpp p N) site of mRNA forming 
hybrids; 

(3) trapping long-strand, full-coding, and/or full-length cDNA hybrids; and 

(4) removing single strand mRNA through digestion with an enzyme capable of cleaving single strand mRNA. 

48. The method of claim 47, wherein said tag molecule is digoxigenin, biotin, avidin, or streptavidin. 

49. The method of any of claims 37-48. wherein said polynucleotide driver for normalization and/or subtraction is RNA 
and/or DNA. 

50. The method of claim 49, wherein said DNA driver is cDNA. 

51. The method of any of claims 37-50, wherein said normalization driver comprises cellular mRNA from the same 
library, the same tissue, or the same cDNA population as what is to be normalized. 

52. The method of any of claims 37-50, wherein said normalization driver comprises single strand cDNA obtained from 
20 the same library, the same tissue, or the same cDNA population as what is to be normalized. 

53. The method of any of claims 37-50 : wherein said subtraction driver comprises cellular mRNA from a library, tissue, 
or cDNA population differing from what is to be subtracted. 

25 54. The method of any of claims 37-50, wherein said subtraction driver comprises single strand cDNA from a library, 
tissue, or cDNA population differing from what is to be normalized. 

55. The method of any of claims 37-54, further comprising a step V) of preparing a second strand of recovered cDNA 
and performing cloning. 
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56. A method of preparing normalized and/or subtracted cDNA comprising the steps of: 



(a) preparing cDNA (tester); 

(b) preparing normalization and/or subtraction RNA (driver); 

35 ( C ) conducting normalization and/or subtraction in two steps in any order, or conducting normalization/sub- 

traction as a single step and mixing the normalization/subtraction RNA driver with said cDNA tester; 

(d) adding an enzyme capable of cleaving single strand sites on RNA drivers non-specifically bound to cDNA 
tester; 

(e) removing said single strand RNA driver cleaved in step d) from the tester and removing tester/driver hybrids; 
40 and 

(f) recovering the normalized and/or subtracted cDNA. 

57. The method of claim 56, wherein the cDNA tester is cloned or uncloned cDNA. 
•*5 58. The method of claim 56, wherein the cDNA tester is reverse transcript of mRNA in form of uncloned cDNA. 

59. The method of claim 56, wherein said cDNA tester is single strand 

60. The method of any of claims 56-59, wherein in step c), normalization is conducted first, followed by subtraction. 

so 

61. The method of any of clams 56-59, wherein in step c), subtraction is conducted first, followed by normalization. 

62. The method of any of claims 56-59, wherein in step c), said tester and normalization and subtraction drivers are 
mixed together and normalization and subtraction are conducted as a single step. 

55 

63. The method of any of claims 56-62, wherein said normalized and/or subtracted cDNA is long-strand, full-coding, 
and/or full-length cDNA. 
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64. The method of any of claims 56-65, wherein the enzyme of said step d) is either selected from the group consisting 
of RNase I, RNaseA, RNase4, RNaseTI , RNaseT2, RNase2, and RNase3, or comprises a mixture thereof. ' 

65. The method of any of claims 56-63, wherein the enzyme of said step d) is RNase I. . 

5 

66. The method of any of claims 56-65, wherein said cDNA tester is prepared by CAP-trapping the 5' end of RNA. 

67. The method of any of claims 56-66, further comprising the step g) of preparing a second strand of recovered cDNA 
and performing cloning. 

10 

68. The method of any of claims 1-67, wherein said tester/driver hybrids are bound to tag molecules. 

69. The method of claim 68, wherein said tag molecule is avidin, streptavidin, biotin, digoxigenin, an antibody, or an 
antigen. 

is 

70. The method of any of claims 1-69, wherein said tester/driver hybrids are removed through the use of a matrix. 

71. The method of claim 70, wherein said matrix is comprised of magnetic beads or agarose beads. 

20 72. The method of claim 71 , wherein said magnetic beads or agarose beads are covered by or bound to any tag 
molecule capable of binding to tag molecules bound to a tester/driver hybrids. 

73. The method of claim 71 , wherein said magnetic beads or agarose beads are covered by or bound to a tag molecule 
capable of binding to avidin, streptavidin, biotin, digoxigenin, an antibody, or an antigen bound to a tester/driver 

2S hybrid. 

74. The method of claim 72 or 73, wherein said antibody covering said beads or said antibody binding said beads is 
an antiantigen antibody, antibiotin antibody, antiavidin antibody, antistreptavidin antibody, or antidigoxigenin anti- 
body. 

30 

75. The method of any of claims 1-74. wherein said tester/driver hybrid is removed through the use of streptavidin/ 
phenol. 

76. The method of any of claims 1-75, wherein hydroxy apadite and nonlabeled RNA are employed to remove said 
35 tester/driver hybrid. 

77. A method of removing RNA nonspecifically bound to DNA by processing nonspecifically bound RNA/DNA hybrids 
with an enzyme capable of degrading single strand RNA. 

4 ° 78. The method of claim 77, wherein said enzyme is either selected from the group consisting of RNase I, RNaseA, 
RNase4, RNaseTI, RNaseT2, RNase2, and RNase3, or comprises a mixture thereof . 

79. The method of claim 77, wherein said enzyme is RNase I. 

45 80. The method of any of claims 77-79, wherein said RNA/DNA hybrid is a product of normalization. 

81. The method of any of claims 77-79, wherein said RNA/DNA hybrid is a product of subtraction. 

82. The method of any of claims 77-79. wherein said RNA/DNA hybrid is the product of a method comprising the steps 
50 of normalization and subtraction in any order or of a method comprising a single normalization/subtraction step. 

83. A method of isolating single strand cDN A comprising the steps of treating a hybrid comprising RNA nonspecifically 
bound to cDNA with an enzyme capable of degrading single strand RNA, removing the degraded single strand 
RNA, and recovering the cDNA. 

55 

84. A method of preparing normalized and/or subtracted cDNA comprising the steps of adding an enzyme capable of 
degrading single strand RNA driver nonspecifically bound to cDNA tester, and removing the degraded single strand 
RNA driver. 



25 



EP 1 197 552 A2 

85. The method of any of claims 77-84, wherein said DNA or cDNA is long-chain t full-coding, and/or full-length cDNA. 

86. The method of any of claims 1-85 employed to prepare one, two, or more libraries. 
5 87. cDN A or a cDNA library obtainable by any of the methods of claims 1 -86. 

88. The cDNA of claim 87 that is single strand or double strand. 

10 
15 
20 
25 
30 

35 

40 

45 

50 



NSOOCIO <EP 1197552A2 I > 



26 




MSOOC1D <EP T197552A2 I > 



28 



EP 1 197 552 A2 



Without Normalization 
/Subtraction 



A/Hind III 



With Normalization 
/Subtraction 



% /Hind III! 



9.4kbp 
6.6kbp 

4.4kbp 



2.2kbp 
2.0kbp 



m 



G.5kbp 



9.4kbp 
6.6kbp 

4.4kbp 



2.2kbp 
2.0kbp 



O.Skbp 



: Highly expressed genes 



Fig. 3 



29 




Standard Normalized 



Fig.4 



30 



EP1 197 552 A2 




(%) AjaAOOSjp ouq3 M9ft 



31 



EP 1 197 552 A2 




Fig.6 



iSDOCID <EP 1197552A2 I > 



32 



(19) 



J 



EuropaischesPatentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(ID EP 1 197 552 A3 

EUROPEAN PATENT APPLICATION 



(88) Date of publication A3: 

30.07.2003 Bulletin 2003/31 

(43) Date of publication A2: 

17.04.2002 Bulletin 2002/16 

(21) Application number: 01120108.4 

(22) Date of filing: 22.08.2001 



(51) lntC!7: C12N 15/10, C12Q 1/68 



(84) Designated Contracting States: 


• Hayashizaki, Yoshihide 


AT BE CH CY DE DK ES Fl FR GB GR IE IT LI LU 


Tsukuba-shi, Ibaraki 305-0061 (JP) 


MC NL PT SETR 


Designated Extension States: 


(72) Inventor: Hayashizaki, Joshihide 


AL LT LV NIK RO SI 


Tukuba-shi, Ibaraki 305-0061 (JP) 


(30) Priority: 25.08.2000 JP 2000255402 


(74) Representative: Gervasi, Gemma, Dr. 




Notarbartolo & Gervasi S.p.A., 


(71) Applicants: 


Corso di Porta Vittoria, 9 


• Riken 


20122 Milano (IT) 


Wako-shi, Saitama 351-0198 (JP) 





(54) Method of preparing normalized and/or subtracted cDNA 



(57) A method of preparing normalized and/or sub- 
tracted cDNA; a method in which the cDNA that is nor- 
malized and/or subtracted is in the form of uncloned cD- 
NA (cDNA tester); a method of preparing normalized 
and/or subtracted cDNA comprising the steps of: (a) 

preparing cDNA tester; (b) preparing normaliza- 
tion and/or subtraction RNA driver; (c) conducting nor- 
malization and/or subtraction in two steps in any order, 
or conducting normalization/subtraction as a single step 
and mixing the normalization/subtraction RNA driver 



with said cDNA tester; (d) adding an enzyme capable of 
cleaving single strand sites on RNA drivers nonspecific 
cally bound to cDNA tester; (e) removing said single 
strand RNA driver cleaved in step d) from the tester and 
removing tester/driver hybrids; and (f) recovering the 
normalized and/or subtracted cDNA; and a method of 
efficiently preparing normalized and/or subtracted long- 
chain, full-coding, and full-length cDNA libraries are pro- 
vided. 



CO 
< 
CM 

in 



Q. 

LU 



Printed by Jouvc. 75001 PARIS (PR) 



EP 1 197 552 A3 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 01 12 0108 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document with indication, where appropriate, 
of relevant passages 



Relevant 
to claim 



CLASSIFICATION OF THE 
APPLICATION (tnLCI.7) 



P,X 



CARNINCI P ET AL: "NORMALIZATION AND 
SUBTRACTION OF CAP -TRAPPER-SELECTED CDNAS 
TO PREPARE FULL-LENGTH CDNA LIBRARIES FOR 
RAPID DISCOVERY OF NEW GENES" 
GENOME RESEARCH, COLD SPRING HARBOR 
LABORATORY PRESS, US, 

vol. 10, no. 10, October 2000 (2000-10), 
pages 1617-1630, XP002944079 
ISSN: 1088-9051 

* the whole document * 

SAGERSTROEM C G ET AL: "SUBTRACTIVE 
CLONING: PAST, PRESENT, AND FUTURE" 
ANNUAL REVIEW OF BIOCHEMISTRY, PALTO ALTO, 
CA, US, 

vol. 66, 1997, pages 751-783, XP000990144 
ISSN: 0066-4154 

* figure 7A * 

CARNINCI P ET AL: "HIGH EFFICIENCY 

SELECTION OF FULL-LENGTH CDNA BY IMPROVED 

BIOTINYLATED CAP TRAPPER" 

DNA RESEARCH, UNIVERSAL ACADEMY PRESS, JP, 

vol. 4, no. 1, 1997, pages 61-66, 

XP002915968 

ISSN: 1340-2838 

* the whole document * 

WO 96 21743 A (AMBION INC ;G0LDRICK 
MARIANNA M (US)) 18 July 1996 (1996-07-18) 



* the whole document * 



-/- 



The present search report has been drawn up (or all claims 



1-88 



C12N15/10 
C12Q1/68 



1-7, 
12-26, 
31-41, 
46, 

49-55, 
68-76, 
86-88 



12-14, 
31, 
46-48,66 



8-11, 

27-30, 

42-45, 

47,48, 

56-67, 

77-85 



TECHNICAL FIELDS 
SEARCHED <lnt.CL7) 



C12N 
C12Q 



Aaoa of *«aieh 

THE HAGUE 



Oat* d completion ot tvm Marc* 

6 June 2003 



Aslund, J 



CATEGORY OF CITED DOCUMENTS 

X : particularly relevant U taken alone 

Y : particularly relevant it combined with another 

document or tne same category 
A : technological background 
O : non-wrilten disclosure 
P:i 



T : theory or principle underlying the invention 
E : earlier patent document, but published on, or 

after the Ming date 
O : document cited in the application 
L : aocumem area tor otner reasons 



& : member of the s 
document 



» patent family, corresponding 



JSOCCIO <EP 1197S52A3 I > 



2 



EP1 197 552 A3 




European Patent 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 01 12 0108 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation ol document with indication, where appropriate, 
of relevant passages 



Relevant 
to claim 



CLASSIFICATION OF THE 
APPLICATION (lnLCl.7) 



SHIBATA Y ET AL: "REPRESENTATIVE 
NORMALIZED AND SUBSTRACED DNA LIBRARIES 
FOR THE CONSTRUCTION OF THE FULL-LENGTH 
CDNA ENCYCLOPEDIA" 

NIPPON BUNSHI SEIBUTSU GAKKAI NENKAI 

PROGRAM KOUEN YOUSHI SHUU , XX; XX, 

vol. 1P-565, 1998, page 322 XP002907992 



TECHNICAL FIELDS 
SEARCHED (lnt.Cl.7) 



The present search report has been drawn up for aO claims 



Place oi search 

THE HAGUE 



Date ol completion of the March 

6 June 2003 



Aslund, J 



CATEGORY OF CITED DOCUMENTS 

X : particularly relevant if taken alone 

Y : particularly relevant if combined with another 

document of the samo category 
A . technological background 
O i non— written dsclosure 
P : intermediate document 



T : theory or principle underlying the invention 
6 ; earlier patent document but published on. or 

after the fifing dale 
O : document cited in the application 
L : document cied for other reasons 



& : member ot the same patent f amity, corresponding 
document 



cnnrirv -co « tmcoiti > - 



3 



EP 1 197 552 A3 



ANNEX TO THE EUROPEAN SEARCH REPORT 
ON EUROPEAN PATENT APPLICATION NO. 



EP 01 12 0108 



This annex lists the patent family members relating to the patent documents cited in the above-mentioned European search report. 
The members are as contained in the European Patent Office EDP We on 

The European Patent Office is in no way liable tor these particulars which are merely given for the purpose of information. 

06-06-2003 



Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 


WO 9621743 A 


18-07-1996 


US 


5891629 A 


06-04-1999 






AU 


4745896 A 


31-07-1996 






WO 


9621743 Al 


18-07-1996 



p 

Sj For more details about this annex : see Official Journal of the European Patent Office, No. 12/82 



MSOOC1D <EP 1197S52A3 1 > 



4 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
LM FADED TEXT OR DRAWING 
[^/BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXfflBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked* please do not report these problems to 
the IFW Image Problem Mailbox. 



THIS PAGE BLANK 



9JSPT0) 



